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Introduction 



It is easy to define modular functions and forms, but less easy to say why they are important, 
especially to number theorists. Thus I shall begin with a rather long overview of the subject. 

Riemann surfaces 

Let X be a connected Hausdorff topological space. A coordinate neighbourhood of P e Z 
is a pair (C/,z) with U an open neighbourhood of P and z a homeomorphism of U onto 
an open subset of the complex plane. A compatible family of coordinate neighbourhoods 
covering X defines a complex structure on X. A Riemann surface is a connected Hausdorff 
topological space together with a complex structure. 

For example, every connected open subset X of C is a Riemann surface, and the unit 
sphere can be given a complex structure with two coordinate neighbourhoods, namely the 
complements of the north and south poles mapped onto the complex plane in the standard 
way. With this complex structure it is called \h& Riemann sphere. We shall see that a torus 
M^/Z^ can be given infinitely many different complex structures. 

Let Z be a Riemann surface, and let V be an open subset of X. A function /: F ^ C 
is said to be holomorphic if, for all coordinate neighbourhoods {U,z) of X, f oz~^ is a 
holomorphic function on z(U). Similarly, one can define the notion of a meromorphic 
function on a Riemann surface. 

The general problem 

We can state the following grandiose problem: study all holomorphic functions on all 
Riemann surfaces. M order to do this, we would first have to find all Riemann surfaces. This 
problem is easier than it looks. 

Let X be a Riemann surface. From topology, we know that there is a simply connected 
topological space X (the universal covering space of X) and a map p:X X which is a 
local homeomorphism. There is a unique complex structure on X for which p:X X isa 
local isomorphism of Riemann surfaces. If F is the group of covering transformations of 
p:X ^X,ihQnX = r\X. 

Theorem 0. 1 Every simply connected Riemann surface is isomorphic to exactly one of 
the following three: 

(a) the Riemann sphere; 

(b) C; 

def 

(c) the open unit disk D = {z e C \ \z\ < 1}. 

Proof. This is the famous Uniformization Theorem, proved independently by Koebe and 
Poincare in 1907. See Cartan 1963, VI, 4.7, or mol0516. □ 

The main focus of this course will be on Riemann surfaces with D as their universal 
covering space, but we shall also need to look at those with C as their universal covering 
space. 
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Riemann surfaces that are quotients of D 



In fact, rather than working with D , it will be more convenient to work with the complex 
upper half plane: 

M = {z e C I 3(z) > 0}. 

The map z |^ is an isomorphism of H onto D (in the language the complex analysts 
use, H and D are conformally equivalent). We want to study Riemann surfaces of the form 
F\EI, where F is a discrete group acting on H. How do we find such /^'s? There is an 
obvious big group acting on H, namely, SL2(M). For a = (" ^) e SL2(M) and z € H, let 

az + b 

a(.z) = — . 

cz + a 

Then 



z + h){cz + d)\ ^s(adz + hcz) 



\cz + d\^ J \cz + d\^ 
But ^s(adz + bc'z) = (ad — be) • S(z), which equals 3(z) because det(a) = 1. Hence 

S(a(z)) = ^s(z)/\cz + d\^ 

for a e SL2(M). In particular, 

zsM ^ a(z) eM. 

The matrix — / acts trivially on on H, and later we shall see that SL2(M)/{ib/} is the full 
group Aut(]HI) of bi-holomorphic automorphisms of H (Proposition 2.1). The most obvious 
discrete subgroup of SL2(M) is F = SL2(Z). This is called the full modular group. For 
any A^, we define 



r{N) 



a b 
c d 



a = l,b = 0,c = 0,d = Imod N, 



and call it the principal congruence subgroup of level A'^. There are lots of other discrete 
subgroups of SL2(M), but the main ones of interest to number theorists are the subgroups of 
SL2(Z) containing a principal congruence subgroup. 

Let Y{N) = r(N)\M and endow it with the quotient topology. Let p:M^ Y(N) be 
the quotient map. There is a unique complex structure on Y{N) such that a function / on 
an open subset U of Y(N) is holomorphic if and only if f o p is, holomorphic on p^^{U). 
Thus f ^ f o p defines a one-to-one correspondence between holomorphic functions on 
U C Y{N) and holomorphic functions on p~^(U) invariant under r(N), i.e., such that 
g(yz) = g(z) for all y € r{N). 

The Riemann surface Y(N) is not compact, but there is a natural way of compactifying 
it by adding a finite number of points. The compact Riemann surface is denoted by X(N). 
For example, 7(1) is compactified by adding a single point. 



Modular functions. 

A modular function f (z) of level A'^ is a meromorphic function on H invariant under r(N) 
and "meromorphic at the cusps". Because it is invariant under r(N), it can be regarded 
as a meromorphic function on Y{N), and the second condition means that it remains 
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meromorphic when considered as a function on X{N), i.e., it has at worst a pole at each 
point of X(Ar)^F(iV). 

In the case of the full modular group, it is easy to make explicit the condition "meromor- 
phic at the cusps" (in this case, cusp). To be invariant under the full modular group means 
that 

H^) = ^<-')'-"(" 

Since (J 1) ^ SL2(Z), we have that /(z + 1) = f(z). The function z e^""'^ defines an 
isomorphism C/Z ^ C \ {0}, and so every function satisfying this condition can be written 
in the form / (z) = f*(q), q = e'^^'^. As z ranges over the upper half plane, q(z) ranges 
over C \ {0}. To say that / (z) is meromorphic at the cusp means that f*(q) is meromorphic 
at 0, and so / has an expansion 

/(z)= ^ anq", q^e^^\ 
n>-No 

in some neighbourhood of 0. 
Modular forms. 

To construct a modular function, we have to construct a meromorphic function on H that is 
invariant under the action of r(N). This is difficult. It is easier to construct functions that 
transform in a certain way under the action of r(N); the quotient of two such functions of 
same type will then be a modular function. 

This is analogous to the following situation. Let 

F^(k) = (kxk ^origin)/ k"" 

and assume that k is infinite. Let k(X, Y) be the field of fractions of k[X, Y]. An element 
/ e k(X,Y) defines a function {a,b) f(a,b) on a subset of k xk, which will pass 
to the quotient F^(k) if and only if f(aX,aY) = fiX,Y) for all a € k"". Recall that a 
homogeneous form of degree J is a polynomial h(X, Y) e k[X, Y] such that h(aX,aY) = 
a'^hiX, Y) for all a e k^. Thus, to get an / satisfying our condition, we need only take the 
quotient g/h of two homogeneous forms of the same degree with h^O. 

The relation of homogeneous forms to rational functions on is exactly the same as 
the relation of modular forms to modular functions. 

Definition 0.2 A modular form of level N and weight 2k is a holomorphic fimction 

/ (z) on EI such that 

(a) f(az) = (cz + d f^ ■ f(z) for all a = (« ^) e r{Ny, 

(b) / (z) is "holomorphic at the cusps". 

For the full modular group, (a) again implies that /(z + 1) = /(z), and so / can be 
written as a function of ^ = g^^riz. condition (b) then says that this function is holomorphic 
at 0, so that 

/(z) = ^fl„^", q = e^^''. 
n>0 

The quotient of two modular forms of level N and the same weight is a modular function 
of level N. 
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Afline plane algebraic curves 

Let A: be a field. An affine plane algebraic curve C over k is defined by a nonzero polyno- 
mial f{X, Y) € k[X, Y]. The points of C with coordinates in a field ^ D ^ are the zeros 
of f{X, Y)mK^ K; we write C{K) for this set. Let k[C] = k[X, Y]/{f{X, Y)), and 
call it the ring of regular functions on C . When f{X, Y) is irreducible (this is the most 
interesting case so far as we are concerned), we write k{C) for the field of fractions of ^[C], 
and call it the field of rational functions on C. 

We say that / {X, Y) is nonsingular if /, |p- have no common zero in the algebraic 
closure ofk. A point where all three vanish is called a singular point on the curve. 

Example 0.3 Let C be the curve defined by = AX^ -aX-b, i.e., by the polynomial 

f(X, F) = 72 _ 4j^3 j^^x + b. 

Assume char k ^2. The partial derivatives of / are 2F and —12X^ + a = -jxi—^X^ + a). 
Thus a singular point on C is a pair (x , y) such that j = and is a repeated root of 
AX^ — aX — b. Therefore C is nonsingular if and only if the roots of AX^ — aX — b are all 
simple, which is true if and only if its discriminant A = a^— llb^ is nonzero. 

Proposition 0.4 Let C be a nonsingular affine plane algebraic curve over C; then C(C) 
has a natural structure as a Riemann surface. 

Proof. Let P be a point in C(C). If (df/dY)(P) / 0, then the implicit function theorem 
shows that the projection (x,y) 1-^ x:C(C) C defines a homeomorphism of an open 
neighbourhood of P onto an open neighbourhood of x(P) in C. This we take to be a 
coordinate neighbourhood of P. If (3//aF)(P) = 0, then (df/dX)(P) ^ 0, and we use 
the projection (x, j) j. □ 

Projective plane curves. 

A projective plane curve C over k is defined by a nonconstant homogeneous polynomial 
F(X,F,Z). Let 

F^(k) = (k^ ^ origin) /k"", 

and write (a : b : c) for the equivalence class of (a,b,c) in F-^(k). As F(X, Y, Z) is homoge- 
neous, F{cx,cy,cz) — c'" • F{x,y,z) for every c e k^, where m — deg{F(X, F, Z)). Thus 
it makes sense to say F(x,y,z) is zero or nonzero for (x:y:z)e F^(k). The points ofC 
with coordinates in a field K k are the zeros of F(X, Y, Z) in P^(A'). Write C(K) for 
this set. Let 

k[C] = k[X, Y, Z]/(F(X, F Z)), 

and call it the homogeneous coordinate ring of C. When F{X, Y, Z) is irreducible, so that 
/f [C] is an integral domain, we write k{C) for the subfield of the field of fractions of fc[C] 
of elements of degree zero (i.e., quotients of elements of the same degree), and we call it the 
field of rational functions on C. 

A plane projective curve C is the union of three affine curves Cx, Cy, Cz defined 
by the polynomials F(l, F, Z), F(X, 1, Z), F(X, Y, 1) respectively, and we say that C is 
nonsingular if all three affine curves are nonsingular. There is a natural complex structure on 
C(C), and the Riemann surface C(C) is compact. 
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Theorem 0.5 Every compact Riemann surface S is of the form C(C) for some nonsingular 
projective algebraic curve C , and C is uniquely determined up to isomorphism. Moreover, 
C(C) is the field of meromorphic functions on S. 

Unfortunately, C may not be a plane projective curve. The statement is far from being 
true for noncompact Riemann surfaces, for example, HI is not of the form C(C) for C an 
algebraic curve. See p24. 

Arithmetic of Modular Curves. 

The theorem shows that we can regard X(N) as an algebraic curve, defined by some 
homogeneous polynomial(s) with coefficients in C. The central fact underlying the arithmetic 
of the modular curves (and hence of modular functions and modular forms) is that this 
algebraic curve is defined, in a natural way, over Q[^7v], where = e)^p(2jTi / N), i.e., 
the polynomials defining X(N) (as an algebraic curve) can be taken to have coefficients in 
Q[^A^], and there is a natural way of doing this. 

This statement has as a consequence that it makes sense to speak of the points of X(N) 
with coordinates in any field containing Q[^7v]. In the remainder of the introduction, I shall 
explain what the points of 7(1) are in any field containing Q. 

Elliptic curves. 

An elliptic curve E over a field k (of characteristic 7^ 2, 3) is a plane projective curve given 
by an equation: 

Y^Z = 4X^-aXZ^-hZ^, A = a^-27P j^O. 

When we replace X with X/c'^ and Y with Y/c^, some c € k^, and multiply through by c^, 
the equation becomes 

Y^Z = iX^-ac'^XZ^-bc^Z^, 

and so we should not distinguish the curve defined by this equation from that defined by the 
first equation. Note that 

j(E) = 1728a 

is invariant under this change. In fact one can show (with a suitable definition of isomorphism) 
that two elliptic curves E and E' over an algebraically closed field are isomorphic if and 
only if j(E) = j(E'). 

Elliptic functions. 

What are the quotients of C? A lattice in C is a subset of the form 

A = + 1jO)2 

with coi and 0)% complex numbers that are linearly independent over M. The quotient C/yl is 
(topologically) a torus. Let C ^ C/yl be the quotient map. The space C/yl has a unique 
complex structure such that a function / on an open subset U of C/ 71 is holomorphic if and 
only if f o holomorphic on p~^(lJ). 
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To give a meromorphic function on C/vl we have to give a meromorphic function / on 
C invariant under the action of A, i.e., such that /(z + A) = / (z) for all A e yl. Define 

XeA,X^O ' ^ 

This is a meromorphic function on C, invariant under A, and the map 

[z] ^ (p(z) : p'(z) : 1):C/A ^ p2(C) 

defines an isomorphism of the Riemann surface C/ A onto the Riemann surface £■(€), where 
£■ is the elliptic curve, 



with 



See I §3. 



Elliptic curves and modular curves. 

We have a map A E(A) = C/A from lattices to elMptic curves. When is E(A) isomorphic 
to £(ylO? If A' = cA for some c e C, then 

[z]^[cz]:C/A^C/A' 

is an isomorphism, and in fact one can show 

E(A) ^ E(A') A' = cA, some ceC". 

By scaling with an element of C^, we can normalize our lattices so that they are of the form 

A{r) = Z-\ + Z-r,somet eU. 

Note that A(r) = A(r') if and only if there is a matrix ^) e SLaCZ) such that t' = 
Thus we have a map 

T £'(r): BI ^ {elliptic curves over C}/ss, 

and the above remarks show that it gives an injection 

r(l)\EI {elUptic curves over C}/«i . 

One shows that the function r i-^ j{E{x)):'M.^ C is holomorphic, and has only a simple 
pole at the cusp; in fact 

(t) = + 744 + 196884? + 21493760?^ + • • • , q = e^'''\ 

It is therefore a modular function for the full modular group. One shows further that it 
defines an isomorphism j : Y(l) C. The surjectivity of j implies that every elliptic curve 
over C is isomorphic to one of the form E(r), some r € H. Therefore 

r(l)\H ( S {elliptic curves over C}/f^ . 
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There is a unique algebraic curve Y(1)q over Q that becomes equal to F(l) over C and has 
the property that its points with coordinates in any subfield L of C are given by 

F(1)q(L) = {elliptic curves over L}/~ 

where E ~ E' if E and E' become isomorphic over the algebraic closure of L. More 
precisely, for all subfields L of C, there is a commutative diagram: 

F(1)(C) < > {elliptic curves over C}/«i 

F(1)q(L) < ^'^ > {elUptic curves over 

in which the vertical arrows are the natural inclusions. 

From this, one sees that arithmetic facts about elliptic curves correspond to arithmetic 
facts about special values of modular functions and modular forms. For example, let E be 
an elliptic curve over an algebraic number field L; when regarded as an elliptic curve over 
C, E is isomorphic to E{x) for some r € C, and we deduce that 

j{x) = j{E{x)) = j{E)eL; 

the transcendental function j takes a value at t which is algebraic! Moreover, the point [t] 
on y(l)(C) is in the image of Y{\)q{L) 7(1)(C). 

For example, if Z + Zr is the ring of integers in an imaginary quadratic field K, one can 
prove in this fashion that, not only is j{x) algebraic, but it generates the Hilbert class field of 
K (largest abeUan extension of K unramified over K at all primes of K). 
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Chapter 



The Analytic Theory 



In this chapter, we develop the theory of modular functions and modular forms, and the 
Riemann surfaces on which they live. 

1 Preliminaries 

In this section we review some definitions and results concerning continuous group actions 
and Riemann surfaces. Following Bourbaki, we require (locally) compact spaces to be 
Hausdorff. Recall that a topological space X is locally compact if every point in X has a 
compact neighbourhood; then every compact subset of X has a compact neighbourhood.' 
We often use [x] to denote the equivalence class containing x. 

Continuous group actions. 

Recall that a group G with a topology is a topological group if the maps 

ig,g')^gg':GxG^G, g^g-':G^G 

are continuous. Let G be a topological group and let X be a topological space. An action of 
Gon X, 

(g,x) gx:G X X ^ X, 

is continuous if this map is continuous. Then, for each g € G, x i-^ gx:X X is a 
homeomorphism (with inverse x i-^ g~^x). An orbit under the action is the set Gx of 
translates of an x € X. The stabilizer of x e X (or the isotropy group at x) is 

Stab(x) = {g e G \ gx = x). 

If X is Hausdorff, then Stab(x) is closed because it is the inverse image of x under the 
continuous map g gx:G X . There is a bijection 

G/ Stab(x) ^ Gx , g- Stab(x) gx; 

'Let ^ be a compact subset of X. For each a & A, there exists an open neighbourhood Ua of a in Z whose 
closure is compact. Because A is compact, it is covered by a finite family of J/q's, and the union of the closures 
of the Ua 's in the family will be a compact neighbourhood of A. 
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I. The Analytic Theory 



in particular, when G acts transitively on X, there is a bijection 

G/ Stab(x) X. 

Let G\X be the set of orbits for the action of G on X, and endow G\X with the quo- 
tient topology. This is the finest topology for which the map p:X G\X, x Gx, is 
continuous, and so a subset U of G\X is open if and only if the union of the orbits in U 
is an open subset of X. Note that 77 is an open map: if ?7 is an open subset of X, then 
(p(U)) = Ug^eG SU, which is clearly open. 

Let H he a subgroup of G. Then H acts on G on the left and on the right, and H\G 
and G/H wee the spaces of right and left cosets. 

Lemma 1 . 1 The space G/ H is Hausdorifif and only if H is closed in G. 

Proof. Write p for the map G G/H, g gH. If G/H is Hausdorff, then eH is a 
closed point of G/H, and so H = p~^(eH) is closed (here e is the identity element of G). 

Conversely, suppose that // is a closed subgroup, and let aH and bH be distinct elements 
of G/H. Since G is a topological group, the map 

f-.GxG^G, ig,g')^g-'g', 

is continuous, and so f~^(H) is closed. As aH ^ hH, (a,h) ^ f~^{H), and so there is 
an open neighbourhood of {a,b), which we can take to be of the form J7 x F, that is disjoint 
from /-I (^)- Now the images of U and K in G/// are disjoint open neighbourhoods of 
aH and bH. □ 

As we noted above, when G acts transitively on X, there is a bijection G/ Stab(x) X 
for any x e X. Under some mild hypotheses, this will be a homeomorphism. 

Proposition 1 .2 Suppose that G acts continuously and transitively on X. If G and X are 
locally compact and Hausdorff, and there is a countable base for the topology of G, then the 
map 

[g] ^gx:G/ Stabix) X 

is a homeomorphism. 

Proof. We know the map is a bijection, and it is obvious from the definitions that it is 
continuous, and so we only have to show that it is open. Let U be an open subset of G, and 
let g e t/; we have to show that gx is an interior point ofUx. 

Consider the map G xG ^ G, {h,h')\-^ ghh' . It is continuous and maps {e, e) into U, 
and so there is a neighbourhood V of e, which we can take to be compact, such that F x F is 
mapped into U ; thus gF^ C U. After replacing F with F n F~\ we can assume = F. 
(Here V'^ = {h'^ \heV};V^ = {hh' \h,h' e F}.) 

As e eV, G = [JgV (union over g e G). Each set gV is a union of open sets in the 
countable base, and we only need to take enough g's in order to get each basic open set 
contained in a gF at least once. Therefore, there is a countable set of elements gi , g2, . . . e G 
such that G — [J gnV. 

As gn V is compact, its image gnVx'mX is compact, and as X is Hausdorff, this implies 
that gnVx is closed. The following lemma shows that at least one of the g„Fx's has an 
interior point. But y gnJ'.X ^ X is a homeomorphism mapping Fx onto g„ Fx, and so 
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Vx has interior point, i.e., there is a point hx € Vx and an open subset W oi X such that 
hxeW c Vx. Now 

gx = gh"^ ■ hx e gh~^ W c gV^x c Ux 

which shows that gx is an interior point ofUx. □ 

Lemma 1 . 3 (B aire' s Theorem) If a nonempty locally compact (hence Hausdorff) space 
X is a countable union X = UneN closed subsets F„, then at least one of the V„ has 
an interior point. 

Proof. Suppose no V„ has an interior point. Take Ui to be any nonempty open subset 
of X whose closure Ui is compact. As Vi has empty interior, Ui is not contained in 
Vi. Now Fi n C/i is proper compact subset of the locally compact space Ui, and so there 
exists a nonempty open subset U2 of Ui such that U2 (Z Ui ^ Ui Ci Vi. Similarly, U2 
is not contained in V2, and so there exists a nonempty open subset of U2 such that 
C/3 C i72 ^ n V2. Continuing in this fashion, we obtain nonempty open sets C/3, U4 ... 
such that U„+i C U„^U„r\Vn. The Un form a decreasing sequence of nonempty compact 
sets, and so (~]Un ^ 0, which contradicts X = [JVn. □ 

Riemann surfaces: classical approach 

Let X be a connected Hausdorff topological space. A coordinate neighbourhood for X is 
pair (U, z) with U an open subset of X and z a homeomorphism of U onto an open subset of 
the complex plane C. Two coordinate neighbourhoods (Ui , z/) and (Uj , zj) are compatible 
if the function 

zi o zj': ZJ (Ui nUj)^ zi (Ui n Uj) 

is holomorphic with nowhere vanishing derivative (the condition is vacuous if U fl Uj — 0). 
A family of coordinate neighbourhoods (Ui , Zi)iei is ^coordinate covering ifX = \J Ui and 
(Ui , Zi) is compatible with (Uj , zj) for all pairs (i,j)elxl. Two coordinate coverings are 
said to be equivalent if their union is also a coordinate covering. This defines an equivalence 
relation on the set of coordinate coverings, and we call an equivalence class of coordinate 
coverings a complex structure on X. A space X together with a complex structure is a 
Riemann surface. 

Let U = (Ui , Zi)iei be a coordinate covering of X. A function f:U ^ C on an open 
subset U of X is said to be holomorphic relative to U if 

foz-^:ziiUnUi)^C 

is holomorphic for all i e I . When W is an equivalent coordinate covering, / is holomorphic 
relative to U if and only if it is holomorphic relative to W, and so it makes sense to say that 
/ is holomorphic relative to a complex structure on X: a function f:U ^ C on an open 
subset C/ of a Riemann surface X is holomorphic if it is holomorphic relative to one (hence 
every) coordinate covering defining the complex structure on X. 

Recall that a meromorphic function on an open subset U of C is a holomorphic function 
/ on the complement U ^ B of some discrete subset SofU that has at worst a pole at each 
point of E (i.e., for each a € 3, there exists an m such that (z — a)'" f (z) is holomorphic in 
some neighbourhood of a). A meromorphic function on an open subset U of a. Riemann 
surface is defined in exactly the same way. 
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Example 1 .4 Every open subset U of C is a Riemann surface with a single coordinate 
neighbourhood {U itself, with the inclusion z:U '-^ C). The holomorphic and meromorphic 
functions on U with this structure of a Riemann surface are just the usual holomorphic and 
meromorphic functions. 

Example 1.5 Let X be the unit sphere 

X:x2 + j2 + z2 = 1 

in M^. Stereographic projection from the north pole P = (0, 0, 1) gives a map 

X + iv 
l — z 

Take this to be a coordinate neighbourhood for X. Similarly, stereographic projection 
from the south pole S gives a second coordinate neighbourhood. These two coordinate 
neighbourhoods define a complex structure on X, and X together with this complex structure 
is called the Riemann sphere. 

Example 1.6 Let Z be the torus M^/Z^. We shall see that there are infinitely many 
nonisomorphic complex structures on X. 

A map f:X—>-X' from one Riemann surface to a second is holomorphic if for each 
point P of X, there are coordinate neighbourhoods (U,z) of P and (U',z') of f(P) such 
that z' o f o : z (U) z (U') is holomorphic. An isomorphism of Riemann surfaces is a 
bijective holomorphic map whose inverse is also holomorphic. 

Riemann surfaces as ringed spaces 

Fix a field k. Let X be a topological space, and suppose that for each open subset U of X, 
we are given a set 0(U) of functions U ^ k. Then O is called a sheaf of k-algebras on X 
if 

(a) f,ge 0(U) =^ f±g, fgeO(U);ihe function x ^ 1 is in 0(U) if [/ ^ 0; 

(b) / € 0(U), V cU^ f\Ve 0(Vy, 

(c) let J7 — [J Ui be an open covering of an open subset U of X, and for each i, let 
fi e O ([//); if fi I Ui n Uj = fj \ Ui n Uj for all /, ; , then there exists an / e 

such that f\Ui — fi for all / . 

When Y is an open subset of X, we obtain a sheaf of A: -algebras 0\Y on Y by re- 
stricting the map U 0(U) to the open subsets of Y, i.e., for all open U cY,v/e define 
(0\Y)(U) = 0(U). 

From now on, by a ringed space we shall mean a pair (X, Ox) with X a topological 
space and Ox a sheaf of C-algebras — we often omit the subscript on O. A morphism 
(p: (X, Ox) (X', Ox') of ringed spaces is a continuous map (p:X ^ X' such that, for all 
open subsets U' of X', 

f e Ox'{U') ^foipe Ox{9~\u')). 

An isomorphism (p: {X, Ox) {X' , Ox') of ringed spaces is a homeomorphism such that 
(p and (p~^ are morphisms. Thus a homeomorphism (p: Z ^ X' is an isomorphism of ringed 
spaces if, for every U open in X with image U' in X' , the map 

f^fo<p:Ox'{U')^Ox{U) 



1. Preliminaries 



17 



is bijective. 

For example, on any open subset V of the complex plane C, there is a sheaf Oy with 

Ov(U) = { holomorphic functions f:U ^ C}, 

all open U C V. We call such a pair (V, Oy) a standard ringed space. 

The following statements (concerning a connected Hausdorff topological space X) are 
all easy to prove. 

1 .7 Let U = (Ui , Zi) be a coordinate covering of X, and, for any open subset U of C, let 
OiU) be the set of functions f:U^C that are holomorphic relative to U. Then U 0{U) 
is a sheaf of C-algebras on X. 

1.8 Let U and U' be coordinate coverings of X; then U and U' are equivalent if and only if 
they define the same sheaves of holomorphic functions. 

Thus, a complex structure on X defines a sheaf of C-algebras on X, and the sheaf 
uniquely determines the complex structure. 

1.9 A sheaf Ox of C-algebras on X arises from a complex structure if and only if it 
satisfies the following condition: 

(*) there exists an open covering X = \jUi of X such that every ringed space 
{Ui , Ox I Ui) is isomorphic to a standard ringed space. 

Thus to give a complex structure on X is the same as giving a sheaf of C-algebras satisfying 
(*). 

Example 1.10 Let w e Z act on C as z i-^ z -|- n. Topologically, C/Z is cylinder. We 
can give it a complex structure as follows: let />:C ^ C/Z be the quotient map; for any 
point P e C/Z, choose a Q e /~^(P); there exist neighbourhoods U of P and V of 
Q such that is a homeomorphism V ^ U; take any such pair (U,p~^:U V) to be 
a coordinate neighbourhood. The corresponding sheaf of holomorphic functions has the 
following description: for every open subset U of C/Z, a function f:U ^ C is holomorphic 
if and only if / o is holomorphic (check!). Thus the holomorphic functions / onU C C/Z 
can be identified with the holomorphic functions on (U) invariant under the action of Z, 
i.e., such that / (z + «) = / (z) for all « e Z (it suffices to check that f{z + l) = f (z), as 1 
generates Z as an abelian group). 

For example, q(z) = g^jr/z defines a holomorphic function on C/Z. It gives an isomor- 
phism C/Z ^ C^ (complex plane with the origin removed) — in fact, this is an isomorphism 
both of Riemann surfaces and of topological groups. The inverse function C^ C/Z is (by 
definition) (2ni)^^ - log. 

Before Riemann (and, unfortunately, also after), mathematicians considered functions 
only on open subsets of the complex plane C. Thus they were forced to talk about "multi- 
valued functions" and functions "holomorphic at points at infinity". This works reasonably 
well for functions of one variable, but collapses into total confusion in several variables. 
Riemann recognized that the functions were defined in a natural way on spaces that were only 
locally isomorphic to open subsets of C, that is, on Riemann surfaces, and emphasized the 
importance of studying these spaces. In this course we follow Riemann — it may have been 
more natural to call the course "Elliptic Modular Curves" rather than "Modular Functions 
and Modular Forms". 
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Differential forms. 

We adopt a naive approach to differential forms on Riemann surfaces. 

A differential form on an open subset U of C is an expression of the form f(z)dz 
where / is a meromorphic function on U. With every meromorphic function / (z) on [/, we 
associate the differential form df = ^dz. Let w.U U' he & mapping from U to another 
open subset U' of C; we can write it z' = w(z). Let (v = f (z')dz' be a differential form on 
U'. Then w*(co) is the differential form f(w(z))^^dz on U. 

Let X be a Riemann surface, and let (Ui,Zi) be a coordinate covering of X. To give 
a differential form on X is to give differential forms coi = f (zi)dzi on (Uj) for each / 
that agree on overlaps in the following sense: let z,- = Wij (zj), so that Wjj is the conformal 
mapping z,- ° z J ^ : (Ui r\Uj) zi {Ui r\Uj); then w*- (a)i ) = u)j, i.e., 

fj (^j)dzj = fi {wij izj)) ■ w'ij (Zj)dzj . 

Contrast this with functions: to give a meromorphic function / on A' is to give meromorphic 
functions fi (z,) on z, (Ui) for each / that agree on overlaps in the sense that 

/; {zj) = fi [Wij (z j )) on z j (Ui nUj). 

A differential form is said to be of the first kind (or holomorphic) if it has no poles on 
X, of the second kind if it has residue at each point of X where it has a pole, and of the 
third kind if it is not of the second kind. 

Example 1.11 The Riemann sphere S can be thought of as the set of lines through the 
origin in C^. Thus a point on S is determined by a point (other than the origin) on the hne. 
In this way, the Riemann sphere is identified with 

pi(C) = (CxC-^{(0,0)})/C^. 

We write (xo:xi) for the equivalence class of (xo,xi); thus (xo'.xi) — (cxq-.cxi) for c 7^ 0. 

Let Uo be the subset where xq ^ 0; then zq: (xq ■ xi) xi/xq is a homeomorphism 
Uq — ^ C. Similarly, if J7i is the set where xi j^O, then zi: (xq : xi) xq/xi is a homeomor- 
phism Ui C. The pair {(Uo,zo), (Ui,zi)} is a coordinate covering of S. Note that zq and 
zi are both defined onUoCiUi, and zi = Zq ^; in fact, zo{Uo CiUi) = C\{0} = zi(Uo r\Ui) 
and the map w;oi:zi(J7o n Ui) zo(C/o H U\) is z z"^. 

A meromorphic function on 5* is defined by a meromorphic function /o(zo) of zq e C 
and a meromorphic function /i (zi) of zi e C such that for zqZi ^ 0, /i (zi) = fo(z^^). In 
other words, it is defined by a meromorphic function /(z)(= /i(zi)) such that /(z~^)is 
also meromorphic on C. (It is automatically meromorphic on C \ {0}.) The meromorphic 
functions on S are exactly the rational functions of z, i.e., the functions P(z)/ Q(z), P,Q e 
C[X], Q^O (see 1.14 below). 

A meromorphic differential form on S is defined by a differential form fo(zo)dzo on C 
and a differential form /i (zi)dzi on C, such that 

f,(z,) = fo(z^')~ for z,^0. 
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Analysis on compact Riemann surfaces. 

We merely sketch what we need. For details, see for example Gunning 1966.^ Note that 
a Riemann surface X (considered as a topological space) is oiientable: each open subset 
of the complex plane has a natural orientation; hence each coordinate neighbourhood of X 
has a natural orientation, and these agree on overlaps because conformal mappings preserve 
orientation. Also note that a holomorphic mapping f:X—^S (the Riemann sphere) can be 
regarded as a meromorphic function on X, and that all meromorphic functions are of this 
form. The only functions holomorphic on the whole of a compact Riemann surface are the 
constant functions. 

Proposition 1.12 (a) A meromorphic function f on a compact Riemann surface has 
the same number of poles as it has zeros (counting multiplicities). 

(b) Let (V be a differential form on a compact Riemann surface; then the sum of the 
residues of co at its poles is zero. 

Sketch of proof. We first prove (b). Recall that if co = fdz is a differential form on an 
open subset of C and C is any closed path in C not passing through any poles of /, then 



(sum over the poles p enclosed by C). Fix a finite coordinate covering (Ui , z, ), = i^...^„ of the 
Riemann surface, and choose a triangulation of the Riemann surface such that each triangle 
is completely enclosed in some Ut; then 2ni(Y^ Respo;) is the sum of the integrals of u> 
over the various paths, but these cancel out. 

Statement (a) is just the special case of (b) in which co = df / f . □ 

When we apply (a) to / — c, c some fixed number, we obtain the following result. 

Corollary 1.13 Let f be a nonconstant meromorphic function on a compact Riemann 
surface X. Then there is an integer n > such that f takes each value exactly n times 
(counting multiplicities ). 

Proof. The number n is equal to the number of poles of / (counting multiplicities). □ 

The integer n is called the valence of /. A constant function is said to have valence 
0. If / has valence n, then it defines a function X ^ S (Riemann sphere) which is n to 1 
(counting multiplicities). In fact, there will be only finitely many ramification points, i.e., 
point P such that (P) has fewer than n distinct points. 

Proposition 1.14 Lets be the Riemann sphere. The meromorphic functions are precisely 
the rational functions of z, i.e., the field of meromorphic functions on S is C(z). 

Proof. Let g(z) be a meromorphic function on S. After possibly replacing g(z) with 
g{z — c), we may suppose that g(z) has neither a zero nor a pole at oo (= north pole). 

^Gunning, R. C, Lectures on Riemann surfaces. Princeton Mathematical Notes Princeton University Press, 
Princeton, N.J. 1966. 
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Suppose that g{z) has a pole of order m, at / = l,...,r, a zero of order rii at qt, 
i = l,...,s, and no other poles or zero. The function 



has no zeros or poles at a point P ^ oo, and it has no zero or pole at oo because (see 1.12) 
=: It is therefore constant, and so 

g(z) = constant X 



Remark 1.15 The proposition shows that the meromorphic functions on S are all algebraic: 
they are just quotients of polynomials. Thus the field M.{S) of meromorphic functions on 
S is equal to the field of rational functions on as defined by algebraic geometry. This is 
dramatically different from what is true for meromorphic functions on the complex plane. 
In fact, there exists a vast array of holomorphic functions on C — see Ahlfors, Complex 
Analysis, 1953, IV 3.3 for a classification of them. 

Proposition 1.16 Let / be a nonconstant meromorphic function with valence n on a 
compact Riemann surface X . Then every meromorphic function g on X is a root of a 
polynomial of degree < n with coefficients in C(/). 

Sketch of proof. Regard / as a mapping X ^ S (Riemann sphere) and let c be a point 
of S such that /"^{c) has exactly n elements {Pi{c), P„(c)}. Let z s X he such that 
/(z) = c; then 

= n(^(^) - SiPi (c))) = (z) + n ic)g"- \z) + - + rn (c) 
i 

where the r, (c) are symmetric polynomials in the g{Pi (c)). When we let c vary (avoiding 
the c for which f(z) — c has multiple zeros), each r, (c) becomes a meromorphic function 
on S, and hence is a rational function of c = /(z). □ 

Theorem 1.17 Let X be a compact Riemann surface. There exists a nonconstant mero- 
morphic function f on X, and the set of such functions forms a finitely generated field 
^A(X) of transcendence degree 1 over C. 

The first statement is the fundamental existence theorem (Gunning 1966, pi 07). Its 
proof is not easy (it is implied by the Riemann-Roch Theorem), but for all the Riemann 
surfaces in this course, we shall be able to write down a nonconstant meromorphic function. 

It is obvious that the meromorphic functions on X form a field A4(X). Let / be a 
nonconstant such function, and let n be its valence. Then (1.16) shows that every other 
function is algebraic over C(/), and in fact satisfies a polynomial of degree < n. Therefore 
A4(X) has degree < « over C(/), because if it had degree > n then it would contain a 
subfield L of finite degree n' > n over C(/), and the Primitive Element Theorem (FT 5.1) 
tells us that then L = C(/)(g) for some g whose minimum polynomial has degree n'. 

Example 1.18 Let S be the Riemann sphere. For every meromorphic function / on 5 
with valence 1, M(S) = C(/). 



1. Preliminaries 



21 



Remark 1.19 The meromorphic functions on a compact complex manifold X of dimension 
m> I again form a field that is finitely generated over C, but its transcendence degree may be 
< m. For example, there are compact complex manifolds of dimension 2 with no nonconstant 
meromorphic functions. 

Riemann-Roch Theorem. 

The Riemann-Roch theorem describes how many functions there are on a compact Riemann 
surface with given poles and zeros. 

Let X be a compact Riemann surface. The group of divisors Div(X) on X is the free 
(additive) abelian group generated by the points on X; thus an element of Dvv(X) is a finite 
sum Pj, Hi € Z. A divisor D — Y^ Ui Pi is positive (or effective) if every > 0; we 
then write D > 0. 

Let / be a nonzero meromorphic function on X. For a point P e X, let ordp (/) = m, 
—m, or according as / has a zero of order m at P, a pole of order m at P, or neither a 
pole nor a zero at P . The divisor of / is 



This is a fiiute sum because the zeros and poles of / form discrete sets, and we are assuming 
X to be compact. 

The map / div(/):7W(X)^ Div(X) is a homomorphism, and its image is called 
the group of principal divisors. Two divisors are said to be Unearly equivalent if their 
difference is principal. The degree of a divisor P,- is ^n,-. The map D deg(D) is a 
homomorphism Div(X) Z whose kernel contains the principal divisors. Thus it makes 
sense to speak of the degree of a linear equivalence class of divisors. 

It is possible to attach a divisor to a differential form co: let P e X, and let (Ui , zi) be a 
coordinate neighbourhood containing P ; the differential form co is described by a differential 
fidzi on Ui, and we set ordp(ft>) = ordp{fi). Then ord^((w) is independent of the choice 
of the coordinate neighbourhood {// (because the derivative of every transition function coij 
has no zeros or poles), and we define 



If CO is one nonzero differential form, then any other is of the form fco for some / € M.{X), 
and so the linear equivalence class of div(ft)) is independent of co; we write K for div((w), 
and t for its linear equivalence class. 
For a divisor D, let 



div(/) = ;^ord^(/).P. 




Again, this is a finite sum. Note that, for every meromorphic function /, 



div(/(w) = div(/) + div(ct>). 



L{D) = {/ € M{X) I div(/) + > 0} 



if this set is nonempty, and let L{D) = {0} if it is empty. Then L(D) is a vector space over 
C, and if Z)' = Z) + (g), then / /g-^ is an isomorphism L(D) L(D'). Thus the 
dimension t(D) of L(D) depends only on the linear equivalence class of D. 
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Theorem 1 .20 (Riemann-Roch) Let X be a compact Riemann surface. Then there is 
an integer g >0 such that, for every divisor D, 

UD) = degiD) + l-g + UK-D). (1) 

Proof. See Gunning 1962, §7, for a proof in the context of Riemann surfaces, and Fulton 
1969,^* Chapter 8, for a proof in the context of algebraic curves. One approach to proving it 
is to verify it first for the Riemann sphere S (see below), and then to regard Z as a finite 
covering of 5. □ 

Note that in the statement of the Riemann-Roch Theorem, we could replace the divisors 
with equivalence classes of divisors. 

Corollary 1.21 A canonical divisor K has degree 2g — 2, and 1{K) = g. 

Proof. Put Z) = in (1). The only functions with div(/) > are the constant functions, 
and so the equation becomes 1 = 0+1 — ^-|- i(K). Hence l(K) = g. Put D = K; then the 
equation becomes g = deg(K) + I — g + I, which gives deg(^) = 2g — 2. □ 

Let K = div(ft)). Then / /oj is an isomorphism from L{K) to the space of holomor- 
phic differential forms on X, which therefore has dimension g. 

The term in the Riemann-Roch formula that is difficult to evaluate is i(K — D). Thus it 
is useful to note that if deg(D) >2g-2, then L(K - D) = (because, for / e M{X}^, 
deg(D) >2g-2^ deg(div(/) A: - Z)) < 0, and so div(/) + K-D can't be a positive 
divisor). Hence: 

Corollary 1.22 Ifdeg(D) >2g-2, thene(D) = deg(D) + l-g. 

Example 1.23 Let X be the Riemann sphere, and let D = mPoo, where Poo is the "point 
at infinity" and m>0. Then L(D) is the space of meromorphic functions on C with at worst 
a pole of order m at infinity and no poles elsewhere. These functions are the polynomials of 
degree < m, and they form a vector space of dimension m -|- 1, in other words, 

€(Z)) = deg(D) + l, 

and so the Riemann-Roch theorem shows that g = 0. Consider the differential dz on C, and 
let z' = l/z. The dz = —1/ z'-^dz', and so dz extends to a meromorphic differential on X 
with a pole of order 2 at oo. Thus deg(div(co)) = —2, in agreement with the above formulas. 

Exercise 1 .24 Prove the Riemann-Roch theorem (1.20) for the Riemann sphere, 

(a) by using partial fractions, and 

(b) by using Example 1.23 and the fact that 1(D) = t(D') for linearly equivalent divisors. 



^Fulton, William. Algebraic curves. W. A. Benjamin, Inc., New York-Amsterdam, 1969; available at 
www .math . Isa.umich. edu/~wf ulton/CurveBook .pdf . 
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The genus of X 

Let X be a compact Riemann surface. It can be regarded as a topological space, and so we 
can define homology groups Ho{X,Q), //i(Z,Q), H2{X,Q). It is known that Hq and H2 
each have dimension 1, and H\ has dimension 2g. It is a theorem that this g is the same as 
that occurring in the Riemann-Roch theorem (see below). Hence g depends only on X as a 
topological space, and not on its complex structure. The Euler-Poincare characteristic of X 
is 

X{X) = dim Ho - dim Hi + dim //a = 2 - 2g. 

Since X is oriented, it can be triangulated. When one chooses a triangulation, then one finds 
(easily) that 

2-lg= V-E + F, 

where V is the number of vertices, E is the number of edges, and F is the number of faces. 

Example 1.25 The sphere can be triangulated by projecting out from the centre of a 
regular tetrahedron whose vertices are on the sphere. In this case V = A, E = 6, F = A, 
which gives g = as expected. 

Example 1.26 Consider the map z z^:D ^ D, e > 1, where D is the unit open disk. 
This map is exactly e : 1 except at the origin, which is a ramification point of order e. 
Consider the differential dz' on D. The map is z' = w(z) = z^, and so the inverse image of 
the differential dz' is dz' = dw(z) = ez^^^dz. Thus w*{dz') has a zero of order e — 1 at 0. 

Theorem 1 .27 (Riemann-Hurwitz Formula) Let f:Y^Xbea holomorphic map- 
ping of compact Riemann surfaces that ism :\ (except over finitely many points). For each 
point P of X, let ep be the multiplicity of P in the fibre of f ; then 

2g(Y) - 2 = m(2g(X) - 2) + ^(ep - 1). 

Proof. Choose a differential co on X such that o) has no pole or zero at a ramification point 
of X. Then f*(jo has a pole and a zero above each pole and zero of co (of the same order 
as that of co) ; in addition it has a zero of order e — I at each ramification point in Y (cf . the 
above example 1.26). Thus 

deg(/*C(j) — mdeg(co) + ^(ep - 1), 

and we can apply (1.21). □ 

Remark 1.28 One can also prove this formula topologically. Triangulate X in such a way 
that each ramification point is a vertex for the triangulation, and pull the triangulation back 
to Y. There are the following formulas for the numbers of faces, edges, and vertices for the 
triangulations of Y and X : 

F(Y) = m-FiX), E(Y) = m-E(X), 
V(Y) = m ■ V{X) - Y^iep - 1). 

Thus 

2 - 2g{Y) = m(2-2g(X)) - J^iep - 1), 
in agreement with (1.27). 
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We have verified that the two notions of genus agree for the Riemann sphere S (they 
both give 0). But for any Riemann surface X, there is a nonconstant function f:X^S (by 
1.17) and we have just observed that the formulas relating the genus of X to that of S is the 
same for the two notions of genus, and so we have shown that the two notions give the same 
value for X. 

Riemann surfaces as algebraic curves. 

Let X be a compact Riemann surface. Then (see 1.17) M.{X) is a finitely generated field 
of transcendence degree 1 over C, and so there exist meromorphic functions / and g onX 
such that M.{X) = C( f, g). There is a nonzero irreducible polynomial 0(X, Y) such that 

Hf,g) = o. 

Then z i-^ ( f(z),g(z)):X — ^ maps an open subset of X onto an open subset of the 
algebraic curve defined by the equation: 

0(X,Y) = 0. 

Unfortunately, this algebraic curve will in general have singularities. A better approach 
is the following. Assume initially that the Riemann surface X has genus > 2 and is not 
hyperelliptic, and choose a basis coo, ■■■,co„, (n = g — I) for the space of holomorphic 
differential forms on X. For P e X,we can represent each coi in the form ft ■ dz in some 
neighbourhood of P . After possibly replacing each coj with fu>i,f a meromorphic function 
defined near P , the ft 's will all be defined at P , and at least one will be nonzero at P . Thus 
(/o(P) : . . . : fn(P)) is a well-defined point of P"(C), independent of the choice of /.It is 
known that the map (p 

P^(MP):...:MP)):X^F"(C) 

is a homeomorphism of X onto a closed subset of P"(C), and that there is a finite set of 
homogeneous polynomials in « + 1 variables whose zero set is precisely (p(X). Moreover, 
the image is a nonsingular curve in P"(C) (Griffiths 1989,^^ IV 3). If X has genus < 2, or 
is hyperelliptic, a modification of this method again realizes X as a nonsingular algebraic 
curve in F" for some n . 

Every nonsingular algebraic curve is obtained from a complete nonsingular algebraic 
curve by removing a finite set of points. It follows that a Riemann surface arises from an 
algebraic curve if and only if it is obtained from a compact Riemann surface by removing 
a finite set of points. On such a Riemann surface, every bounded holomorphic function 
extends to a holomorphic function on the compact surface, and so is constant. As |^ is a 
nonconstant bounded holomorphic function on H, we see that H is not the Riemann surface 
attached to an algebraic curve. 



Griffiths, Phillip A., Introduction to algebraic curves. AMS 1989. 
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2 Elliptic Modular Curves as Riemann Surfaces 

In this section, we define the Riemann surfaces Y(N) = r(N)\M and their natural com- 
pactifications, X(N). Recall that IHI is the complex upper half plane 

m={zeC\ > 0}. 
The upper-half plane as a quotient of SL2(M) 

We saw in the Introduction that there is an action of SL2(M) on H as follows: 

SL2(M)xIHI^M, (a,z)^a(z) = ^^i^. « = ! ^ I ■ 

cz + d \c a J 

Because ^(az) = S(z)/|cz + Jp, 3(z) > ^ 3(az) > 0. When we give SL2(M) and M 
their natural topologies, this action is continuous. 

The special orthogonal group (or "circle group") is defined to be 

cos 6 sin 
- sin 9 cos f 



S02(M) 



Note that S02(M) is a closed subgroup of SL2(M), and so SL2(M)/S02(M) is a Hausdorff 
topological space (by 1.1). 

Proposition 2.1 (a) The group SL2(M) acts transitively on H, i.e., for every pair of 
elements z,z' eM, there exists ana e SL2(M) such that a z — z' . 

(b) The action of SL2 (M) on H induces an isomorphism 

SL2(M)/{±/} Aut(]HI) (biholomorphic automorphisms of H) 

(c) The stabilizer of i is SO2 (M) . 

(d) The map 

SL2(M)/S02(i?)^H, a-SO2(^)^a(0 
is a homeomorphism. 

Proof, (a) It suffices to show that, for every z e H, there exists an element of SL2(M) 
mapping z to z. Write z = x + iy; then {^) ^(01)^ SL2(M) and maps / to z. 

(b) If ^ ) -z = z then cz^ + {d -a)z-b = Q. If this is true for all z e IHI (any three 
z's would do), then the polynomial must have zero coefficients, and so c = 0, J = a, and 
b = 0. Thus ("j) = (oa)' ™^ ^^^^ determinant 1 if and only if a = ±1. Thus only / 
and — / act trivially on H. 

Let Y be an automorphism of IH. We know from (a) that there is an a 6 SL2 (M) such 
that a{i) = y(/). After replacing y with o y, we can assume that y{i) = i. Recall that 
the map p: IH ^ D, z |^ is an isomorphism from EI onto the open unit disk, and it maps 
i to 0. Use p to transfer y into an automorphism y' of D fixing 0. Lemma 2.2 below tells us 
that there is a e R such that p o y o p^^ (z) = e^^' ■ z for all z, and Exercise 2.3(c) shows 
that y(z) = (4-^ f^^l)-z. Thus y e S02(M) C SL2(«). 

(c) We have already proved this, but it is easy to give a direct proof. We have 

ai+b . . , ,. , , 

= ; <^=^ ai + b = —c + di <^=^ a = d, b = —c. 

ci + d 

Therefore the matrix is of the form with + b'^ = 1, and so is in S02(M). 

(d) This is a consequence of the general result (1.2). □ 
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Lemma 2.2 The automorphisms of D fixing are the maps of the form z Az, |A| = 1. 

Proof. This is an easy consequence of the Schwarz Lemma (Cartan 1963, III 3), which 
says the following: 

Let /(z) be a holomorphic function on the disk |z| < 1 and suppose that 

/(0) = 0, |/(z)|<lfor|z|<l. 

Then 

(i) |/(z)|<|z|for|z|<l; 

(ii) if |/(zo)| = l^ol for some zq ^ 0, then there is a A such that /(z) = Az 
(and |A| = 1). 

Let Y be an automorphism of D fixing 0. When we apply (i) to y and y~\ we find that 
|y(z)| — \z\ for all z in the disk, and so we can apply (ii) to find that / is of the required 
form. □ 

Exercise 2.3 Let V^: x ^ C be the Hermitian form 



/zA ^Wl 

\Z2 J ' \W2 



and let SU(1, 1) (special unitary group) be the subgroup of elements a e SL2( C) such that 
\[r(a(z),a(w)) = \[r(z,w). 

(a) Show that 

V u. 



SU(1,1) 



M, f e C, ImP — luP = 1 



\v uj 



z 



(b) Define an action of SU(1, 1) on the unit disk as follows: 

uz + v 
vz + u 

Show that this defines an isomorphism SU(1, l)/{±/} Aut(D). 

(c) Show that, under the standard isomorphism p:M ^ D, the action of the element 
i-Ze cose) °f SL2(M) on H corresponds to the action of (^""'^ J^g^ on D. 

Quotients of EI 

Let F be a group acting continuously on a topological space X. If is Hausdorff, then 
the orbits are closed, but this condition is not sufficient to ensure that the quotient space is 
Hausdorff. 

Proposition 2.4 Let G be a locally compact group acting on a topological space X 
such that for one (hence every) point xq e X, the stabilizer K of xq in G is compact and 
gK gxo'.G/ K X is a homeomorphism. The following conditions on a subgroup F of 
G are equivalent:^ 



^In an unfortunate terminology (Lee, Introduction to Topological Manifolds, p268), these conditions are 
also equivalent to: (c) F acts discontinuously on X;{d) F acts properly discontinuously on X. A continuous 
action F xX ^ X is said to be discontinuous if for every x G X and infinite sequence (y, ) of distinct elements 
of F, the set {yix} has no cluster point; it is said to be properly discontinuous if, for every pair of points x and 
y of X, there exist neighbourhoods Ux and Uy of x and y such that the set {y € F \ yUx <~\Uy ^ &} is finite. 
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(a) for all compact subsets A and B of X , {y e r\ yAH B ^ 0} is finite; 

(b) r is a discrete subgroup of G. 

Proof, (b) =J> (a) (This is the only implication we shall use.) Write p for the map, 
g gXQ.G X. Let A he a. compact subset of X.l claim that p~^(A) is compact. Write 
G = [jVj where the Vj are open with compact closures Vi. Then A c[J p(Vi), and in fact 
we need only finitely many /?(l^)'s to cover A. Then p^^{A) c[jViK c[jViK (finite 
union), and each ViK is compact (it is the image of Vi x K under the multiplication map 
G xG ^ G). Thus p~^(A) is a closed subset of a compact set, and so is compact. Similarly, 
p~^(B) is compact. 

Suppose yAHB j^Qmdy e r. Then y(p~'^A)n p~^ B ^ 0, and so y ern(p^'^B)- 
{p^^A)~^. But this last set is the intersection of a discrete set with a compact set and so is 
finite. 

For (a) =^ (b), let V be any neighbourhood of 1 in G whose closure V is compact. For 
any xeX, FnKc{yeF| yx e V ■ x}, which is finite, because both {x} and V ■ x are 
compact. Thus F fl K is discrete, which shows that e is an isolated point of F . □ 

The next result makes statement (a) more precise. 

Proposition 2.5 Let G, K, X be as in (2.4), and let T be a discrete subgroup of G. 

(a) For every x e X , {g e F \ gx = x} is finite. 

(b) For any x e X, there is a neighbourhood U of x with the following property: if y e F 
and U n yU ^ 0, then yx = x. 

(c) For any points x andy e X that are not in the same F -orbit, there exist neighbourhoods 
Uofx and V of y such thatyU PiV = for all y e F. 

Proof, (a) We saw in the proof of (2.4) that (compact) is compact, where p(g) = gx. 
Therefore p~^ (x) is compact, and the set we are interested in is (x) fl F. 

(b) Let K be a compact neighbourhood of x. Because (2.4a) holds, there is a finite set 
{yi , Yn] of elements of F such that F fl y,- K 7^ 0. Let y\ , ...,ys be the y, 's fixing x. For 
each i > s, choose disjoint neighbourhoods Vi of x and Wi of yiX, and put 



For i > s, yi U cWi which is disjoint from Vi, which contains U. 

(c) Choose compact neighbourhoods ^ of x and B of y, and let yi , y„ be the elements 
of F such that y,-^ fl S 7^ 0. We know y,x 7^ y, and so we can find disjoint neighbourhoods 
Ui and Vi of yiX and y. Take 



Corollary 2.6 Under the hypotheses of (2.5), the space F\X is Hausdorff. 

Proof. Let x and y be points of X not in the same F -orbit, and choose neighbourhoods U 
and V as in (2.5). Then the images of U and V in F\X are disjoint neighbourhoods of Fx 




u = An y^'Ui n...ny-'u„, F = s n Fl n ... n F„. 



□ 



and Fy . 



□ 



A group F is said to act freely on a set X if Stab(x) = e for all x e X. 
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Proposition 2.7 Let r be a discrete subgroup of SL2(M) such that F (or r/{±I} if 
—I e F) acts freely on H. Then there is a unique complex structure on F\M with the 
following property: a function f on an open subset U of F\M is holomorphic if and only if 
f o p is holomorphic. 

Proof. The uniqueness follows from the fact (see 1.8) that the sheaf of holomorphic 
functions on a Riemann surface determines the complex structure. Let z e F\M, and choose 
an X e p^^(z). According to (2.5b), there is a neighbourhood U of x such that yU is 
disjoint from U for all y 6 y e. The map p\U:U p(U) is a homeomorphism, and 
we take all pairs of the form (p(U),{p\U)~^) to be coordinate neighbourhoods. It is easy 
to check that they are all compatible, and that the holomorphic functions are as described. 
(Alternatively, one can define 0(U) as in the statement of the proposition, and verify that 
U 0{L() is a sheaf of C-algebras satisfying (1.9(*).) □ 

Unfortunately SL2(Z)/{±/} doesn't act freely. 
Discrete subgroups of SL2(M) 

To check that a subgroup F of SL2(M) is discrete, it suffices to check that e is isolated in F. 
A discrete subgroup of PSL2(M) is called a Fuchsian group. Discrete subgroups of SL2(M) 
abound, but those of interest to number theorists are rather special. 

Congruence subgroups of the elliptic modular group 

Clearly SL2(Z) is discrete, and a fortiori, F(N) is discrete. A congruence subgroup of 
SL2(Z) is a subgroup containing F(N) for some A^. For example, 

c = modA^)| 

is a congruence subgroup of SL2(Z). By definition, the sequence 

1 F(N) SL2(Z) SL2(Z/A^Z) 

is exact.^ I claim that the map SL2(Z) SL2{1^/ NIj) is surjective. To prove this, we have 
to show that if ^ e M2(Z) and det(^) = 1 mod N, then there is a 5 e M2(Z) such that 
B = A mod and det(S) = 1. Let ^ = ^); the condition on A is that 

ad — be — Nm = 1 

for some m elj. Hence gcd(c,d, N) = I, and we can find an integer n such that gcd(c, d + 
nN) = I (apply the Chinese Remainder Theorem to find an n such that d +nN = \ mod p 
for every prime p dividing c but not dividing and n = mod p for every prime p 
dividing both c and A'^). We can replace d with d + nN , and so assume that gcd(c, J) = 1. 
Consider the matrix 

g^(a + eN b + fN 
\ c d 

^For a commutative ring A, MjiA) is the ring of 2 x 2 matrices with entries in A, and SL2(^) is the group 
of 2 X 2 matrices with entries in A having determinant 1 . 
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for some integers e, f . Its determinant is ad — be + N(ed — fc) — I + (m + ed — fc)N. 
Since gcd(c, c?) = 1, there exist integers e, f such that m = fc + ed, and with this choice, 
B is the required matrix. 

Note that the surjectivity of SL2(Z) SL2(Z/NZ) implies that SL2(Z) is dense in 
SL2(Z), where Z is the completion of Z for the topology of subgroups of finite index (hence 
Z~Um^Z/AfZ~n^£)- 

Discrete groups coming from quaternion algebras. 

For any rational numbers a,b, let B = B^^b be the Q-algebra with basis {1,/,7,A:} and 
multiplication given by 

= a , = b , ij = k = —ji. 

Then S (g) M is an algebra over M with the same basis and multiplication table, and it is 
isomorphic either to M2(M) or the usual (Hamiltonian) quaternion algebra — we suppose the 
former. 

For a = w + X i + yj + zk e B , let a = w — X i — yj —zk, and define 

Nm(a) = aa = w^ — ax^ — by^ + abz'^ e Q. 

Under the isomorphism B (gi M ^ M2(M), the norm corresponds to the determinant, and so 
the isomorphism induces an isomorphism 

{a e 5 (8>R I Nm(a) = 1} ^ SL2(M). 

An order in B is a subring O that is finitely generated over Z (hence free of rank 4). Define 

ra,b = {aeO\ Nm(a) = 1}. 

Under the above isomorphism this is mapped to a discrete subgroup of SL2(M), and we can 
define congruence subgroups of P^^b for SL2(Z). 

For a suitable choice of (a,b), B = M2(Q) (ring of 2 x 2 matrices with coefficients in 
Q), and if we choose O to be M2(Z), then we recover the elliptic modular groups. 

If B is not isomorphic to M2(Q), then the families of discrete groups that we get are 
quite different from the congruence subgroups of SL2(Z): they have the property that r\M 
is compact. 

There are infinitely many nonisomorphic quaternion algebras over Q, and so the congru- 
ence subgroups of SL2(Z) form just one among an infinite sequence of families of discrete 
subgroups of SL2(M). 

[These groups were found by Poincare in the 1880's, but he regarded them as automor- 
phism groups of the quadratic forms <Pa,b — —aX-^ — bY-^ + abZ-^. For a description of 
how he found them, see p52, of his book. Science and Method.] 

Exercise 2.8 Two subgroups F and F' of a group are said to be commensurable if 
r n r' is of finite index in both F and F' . 

(a) Commensurability is an equivalence relation (only transitivity is nonobvious). 

(b) If F and F' are commensurable subgroups of a topological group G, and F is discrete, 
then so also is F' . 

(c) If F and F' are commensurable subgroups of SL2(M) and F\EI is compact, so also 

is r'\H. 
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Arithmetic subgroups of the elliptic modular group 

A subgroup of SL2(Q) is arithmetic if it is commensurable with SL2(Z). For example, 
every subgroup of finite index in SL2(Z), hence every congruence subgroup, is arithmetic. 
The congruence subgroups are sparse among the arithmetic subgroups: if we let N(m) be 
the number of congruence subgroups of SL2(Z) of index < m, and let N'(m) be the number 
of subgroups of index < m, then N{m)/N'(m) ^ as m ^ oo. 

Remark 2.9 This course will be concerned with quotients of H by congruence groups in 
the elliptic modular group SL2(Z), although the congruence groups arising from quaternion 
algebras are of equal interest to number theorists. There is some tantalizing evidence that 
modular forms relative to other arithmetic groups may also have interesting arithmetic 
properties, but we shall ignore this. 

There are many nonarithmetic discrete subgroups of SL2(M). The ones of most interest 
(to analysts) are those of the "first kind" — they are "large" in the sense that r'\SL2(M) 
(hence FyH) has finite volume relative to a Haar measure. 

Among matrix groups, SL2 is anomalous in having so many discrete subgroups. For 
other groups there is a wonderful theorem of Margulis (for which he was awarded the Fields 
medal), which says that, under some mild hypotheses (which exclude SL2), every discrete 
subgroup r of G(M) such that F\G(M) has finite measure is arithmetic. For many groups 
one even knows that all arithmetic subgroups are congruence (see Prasad 1991"^). 

Classification of linear fractional transformations 

The group SL2(C) acts on C^, and hence on the set (C) of lines through the origin in C'^. 
When we identify a line with its slope, (C) becomes identified with C U {00}, and we get 
an action of GL2(C) on C U {00} : 



These mappings are called the linear fractional transformations of P (C) = C U {00}. They 



the identity transformation. By the theory of Jordan canonical forms, every nonscalar a is 
conjugate to a matrix of the following type, 



according as it has repeated eigenvalues or distinct eigenvalues. In the first case, a is 
conjugate to a transformation z z + and in the second to z cz, c \. In case (i), 
a is called parabolic, and case (ii), it is called elliptic if |c| = 1, hyperbolic if c is real and 
positive, and loxodromic otherwise. 




map circles and lines in C into circles or lines in C. The scalar matrices 





^Prasad, Gopal. Semi-simple groups and arithmetic subgroups. Proceedings of the International Congress 
of Mathematicians, Vol. I, II (Kyoto, 1990), 821-832, Math. Soc. Japan, Tokyo, I99I. 



2. Elliptic Modular Curves as Riemann Surfaces 



31 



When a e SL2(C), the four cases can be distinguished by the trace of a : 

a is parabolic <^=^ Tr(a) = ±2; 

a is elliptic <^=^ Tr(a) is real and | Tr(Q;)| < 2; 

a is hyperbolic <^=^ Tr(a) is real and | Tr(a)| > 2; 

a is loxodromic <^=^ Tr(a) is not real. 

We now investigate the elements of these types in SL2(M). 



Parabolic transformations Suppose a e SL2(M), a ^ ±7, is parabolic. Then it has exactly 
one eigenvector, and that eigenvector is real. Suppose that the eigenvector is ( y- ) ; if / y^O, 
then a has a fixed point in R; if / = 0, then oo is a fixed point (the transformation is then of 
the form z z + c). Thus a has exactly one fixed point in M U {oo}. 



Elliptic transformations Suppose a e SL2(M), a ^ ±7, is elliptic. Its characteristic poly- 
nomial is + bX + 1 with \b\ < 2; hence A = — A <Q, and so a has two complex 
conjugate eigenvectors. Thus a has exactly one fixed point z in H and a second fixed point, 
namely, z, in the lower half plane. 



Hyperbolic transformations Suppose a e SL2(M) and a is hyperbolic. Its characteristic 

polynomial is X'^ + bX + 1 with \b\ > 2; hence A = b-^ — 4 > 0, and so a has two distinct 
real eigenvectors. Thus a has two distinct fixed points in M U {oo}. 

Let F be a discrete subgroup of SL2(M). A point z e HI is called an elliptic point if it is 
the fixed point of an elliptic element y of 7^; a point 5 e M U {oo} is called a cusp if there 
exists a parabolic element y e F with s as its fixed point. 

Proposition 2.10 Ifz is an elliptic point of F, then {y e F \ yz — z} is a finite cyclic 
group. 

Proof. There exists an a e SL2(M) such that a(i) = z, and y i-^ a~^ya defines an iso- 
morphism 

{y€F\ yz = z} S02(M)n(a"ira). 
This last group is finite because it is both compact and discrete. There are isomorphisms 

M/Z ~ {zeC||z| = l} ~ S02(M) 

\sia9 cos9 ' 

and so 

S02(M) tors - (M/Z)to,, = Q/Z. 

Every finite subgroup of Q/Z is cyclic (each is of the form n 
common denominator of the elements of the group). 

Remark 2.11 Let F(l) be the full modular group SL2(Z). I claim the cusps of F(l) are 
exactly the points of Q U {oo}, and each is 7^(l)-equivalent to oo. Certainly oo is the fixed 
point of the parabolic matrix r = (q } ). Suppose m/n e Q; we can assume m and n to be 
relatively prime, and so there are integers r and s such that rm — sn = 1; let y = 
then y(oo) = m/n, and m/n is fixed by the parabolic element yTy~^. Conversely, every 



Z/Z where n is the least 

□ 
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parabohc element a of F(l) is conjugate to ±T, say a = ±YTy~ , y e GL2(Q). The point 
fixed by a is yoo, which belongs to Q U {oo}. 

We now find the elliptic points of r(l). Let y be an elliptic element in r(l). The 
characteristic polynomial of y is of degree 2, and its roots are roots of 1 (because y has finite 
order). The only roots of 1 lying in a quadratic field have order dividing 4 or 6. From this, it 
easy to see that every elliptic point of EI relative to F(l) is /^(l)-equivalent to exactly one 
of / or p = (1 +/a/3)/2. (See also 2.12 below.) 

Now let F be a subgroup of F(l) of finite index. The cusps of F are the cusps of 7^(1), 
namely, the elements ofQU{oo} = P^ (Q), but in general they will fall into more than one 
F -orbit. Every elliptic point of F is an elliptic point of 7^(1); conversely, an elliptic point of 
F(l) is an elliptic point of F if an only if it is fixed by an element of F other than ±7. 

Fundamental domains 

Let 7^ be a discrete subgroup of SLaCM). A fundamental domain for F is a connected open 
subset D of H such that no two points of D are equivalent under 7^ and M = [JyD, where 
D is the closure of D . These conditions are equivalent respectively, to the statements: the 
map D F\E. is injective; the map D F\M is surjective. Every 7^ has a fundamental 
domain, but we shall prove this only for the subgroups of finite index in 7^(1). 
Let S = {1 -1) and 7 = (i }). Thus 

Sz = ^, Tz = z + \ 

= 1 mod ± 7, (Sry = 1 mod ± 7. 

To apply 5 to a z with \z\ = 1, first reflect in the x-axis, and then reflect through the origin 
(because S(e'^) = -(e"'^)). 

Theorem 2.12 LetT) = {z eH | |z| > 1, |iH(z)| < 1/2}. 

(a) D is a fundamental domain for F(l) = SL2(Z); moreover, 
two elements z and z' of D are equivalent under F{\) if and 
only if 

i) SR(z) = ±1/2 andz' = z± 1, (then z' = Tz or z = 
Tz'), or 

ii) |z| = 1 and z' = — 1/z = 5z. 

(b) Let z e D; if the stabilizer ofzj^{±F}, then 

i) z = i, and Stab{i) = {S), wiiicTi has order 2 in 
F(l)/{±I}),or 

ii) z = p = exp(2ni/6), and Stab(p) = (TS), which has 
order3inF(l)/{±I}),or 

iii) z — p^, and Stab(p^) = {ST), which has order 3 in 

r(i)/{±i}). 

(c) The group 7^(l)/{ib7}is generated by S andT . 

Proof. Let F' be the subgroup of 7^(1) generated by 5* and T . We shall show that 
F' -D = m. Recall that, if y = (« then 3(yz) = 3(z)/|cz + d\'^ . Fix a z £ M. Lemma 
2.13 below implies that there exists ay = ("^)6F' such that | cz + 1 is a minimum. Then 
S(yz) is a maximum among elements in the orbit of z. 
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For some n, z' = r"(yz) will have 

-l/2<9ft(z')< 1/2. 

I claim that |z'| > 1. If not, then 

3(5z') = 3(-l/z') = S = ^ > = ^(5^^)' 

which contradicts our choice of yz. We have shown that F' ■ D — W. 

Suppose z,z' & D are F-conjugate. Then either S(z) > S(z') or S(z) < S(z'), and we 
shall assume the latter. Suppose z' = yz with y = and let z = x + iy. Then our 

assumption impUes that 

(cx + i/)^ + (cj)^ = |cz + (/p < 1. 

This is impossible if c > 2 (because > 3/2), and so we need only consider the cases 
c = 0,l,-l. 

c = 0: Then <i = ±1, y = it (J and y is translation by h. Because z and yz e Z), 
this implies that 6 = ±1, and we are in case (a(i)). 

c = 1: As |z + J| < 1 we must have <i = 0, unless z = p = ^ + / in which case 
(i = or —1, or z = p^, in which case <i = or 1. If J = 0, then y = ± "^^), and 
yz = a — J. If a = 0, then we are in case (a(ii)). If a 7^ 0, then a = 1 and z = p^,oxa = —1 
and z = p. 

c = — 1: This case can be treated similarly (or simply change the signs of a,b,c,d m 
the last case). 

This completes the proof of (a) and (b) of the theorem. 

We now prove (c). Let y e P. Choose a point zq e D. Because F' -D = H, there 

is an element y' € F' and a point z e D such that y'z = yzo e D. Then zq is F(l)- 
equivalent to (y'~^y)zo e D; because zq € D, part (a) shows that zq = (y'~ V)-^o- Hence 
y'~V € Stab(zo) n F{1) = {±1}, and so y' and y are equal in r(l)/{±l}. 

Lemma 2.13 For a fixed z e H and N eN, there are only finitely many pairs of integers 
(c,d) such that 

\cz + d\ < N. 

Proof. Write z = x + iy. If (c,d) is such a pair, then 

|cz + J p = (cx + df + c^y^ , 

so that 

c^y^ < (cx + (^)^ + c^y^ < N. 

As z e H, J > 0, and so |c| < N/y, which imphes that there are only finitely many possibil- 
ities for c. For any such c, the equation 

(cx + df + c^y^<N □ 
shows that there are only finitely many possible values ofd. □ 
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Remark 2.14 We showed that the group r{l)/{±I} has generators S and T with relations 
= 1 and (ST)^ = 1. One can show that this is a full set of relations, and that r(l)/{±I} 

is the free product of the cyclic group of order 2 generated by S and the cyclic group of 

order 3 generated by ST. 

Most finite simple groups of Lie type are generated by an element of order 2 and an 

element of order 3, and all but three of the sporadic simple groups are. The simple groups 

with such generators are quotients of F(l) by an arithmetic subgroup, most of which are 

not congruence subgroups (because the only simple quotients of SL2(Z/A'^Z) are the groups 

PSL2(Z/ plj) with p a prime dividing A^). 

Aside 2.15 Our computation of the fundamental domain has applications for quadratic forms and 
sphere packings. 

Consider a binary quadratic form: 

q{x,y) = ax^ + bxy + cy^, a,b,c 

Assume q is definite, i.e., its discriminant A = — 4ac < 0. Two forms q and q' are equivalent if 
there is a matrix A e SL2(Z) taking q into q' by the change of variables. 

In other words, the forms 

qix,y) = (x,y)-Q- (^J^ , q'ix,y) = (x, y)-Q'- 

are equivalent if Q = A'^ ■ Q' ■ A. 

Every definite binary quadratic form can be written q{x. y) = a(x — 0}y){x — coy) with &) e H. 
The association ^ ^ « is a one-to-one correspondence between the definite binary quadratic forms 
with a fixed discriminant A and the points of H. Moreover, two forms are equivalent if and only if the 
points lie in the same SL2(Z)-orbit. A definite binary quadratic form is said to be reduced if <x> is in 

{z e H I -i < iH(z) < 1 and |z| > 1, or |z| = 1 and - i < m(z) < 0}. 

More explicitly, q(x,y) = ax^ + bxy + cy^ is reduced if and only if either 

—a<b<a<coTO<b<a = c. 

Theorem 2.12 implies: 

Every definite binary quadratic form is equivalent to a reduced form; two reduced forms 
are equivalent if and only if they are equal. 

We say that a quadratic form is integral if it has integral coefficients. There are only finitely 
many equivalence classes of integral definite binary quadratic forms with a given discriminant. Each 
equivalence class contains exactly one reduced form ax^ + bxy + cy^. Since 

4(2^ < 4ac = b^-A<a^-A 

we see that there are only finitely many values of a for a fixed A. Since \b\ <a, the same is true of b, 
and for each pair (a,b) there is at most one integer c such that b'^ — Aac = A. For more details, see 
Chapter 1 of W. LeVeque, Topics in Number Theory, II, Addison- Wesley, 1956. 

We can apply this to lattice sphere packings in M^. Such a packing is determined by the lattice of 
centres of the spheres (here disks). The object, of course, is to make the packing as dense as possible. 
To a lattice A in and a choice of a basis {/i , fj} for A, we attach the quadratic form 



q(xi,X2) = 11/1X1-^/2X2 
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The problem of finding dense sphere packings then becomes that of finding quadratic forms q with 

def min{^(;t) | x e Z^, x ^ 0} 
ydisc(^) 

as large as possible. Changing the choice of basis for A corresponds to acting on q with an element 
of SL2(Z), and so we can confine our attention to reduced quadratic forms. It is then easy to show 
that the quadratic form with y(^) minimum is that corresponding to p. The corresponding lattice has 

basis (q) and ^ (just as you would expect), and the quadratic form is 4(x^ + xy + j^). 

Fundamental domains for congruence subgroups 

First we have the following general result. 

Proposition 2.16 Let F bea discrete subgroup of SL2(M), and let D be a fundamental 
domain for F. Let F' be a subgroup of F of finite index, and choose elements Yi,...,Ym in 
F such that 

F = F'yi U . . .U F'fm (disjoint union) 

def 

where a bar denotes the image in Aut(]HI). Then D' = [J y,- D is a fundamental domain for 
F' (possibly nonconnected). 

Proof. Let z eM. Then z = yz' for some z' e D, y e F , and y = ±y'yi for some y' e F'. 
Thusz = y'yiz' e F'-(YiD). 

If yD' r\ D' ^ 0, then it would contain a transform of O. But then yyiD = yj D for 
some / 7^ j , which would imply that yy,- = ±yy, and this is a contradiction. □ 

Proposition 2.17 It is possible to choose the y, so that the closure of D' is connected; 
the interior of the closure of D' is then a connected fundamental domain for F. 

Proof. Omitted. □ 

Remark 2.18 Once one has obtained a fundamental domain for F, as in (2.16), it is 
possible to read off a system of generators and relations for F. 

In a future version, there will be many diagrams of fundamental domains. In the 
meantime, the reader can look at H. Verrill's fundamental domain drawer at http : //www . 
math . Isu . edu/~verrill/ 

Defining complex structures on quotients 

Before defining H* and the complex structure on the quotient F\]HI* we discuss two simple 
examples. 

Example 2.19 Let D be the open unit disk, and let Z\ be a finite group acting on D and 
fixing 0. The Schwarz lemma (see p26) implies that Aut(Z),0) = {z e C \ \z\ = 1} M/Z, 
and it follows that A is a finite cyclic group. Let z be its generator and suppose 

that = I. Then z'" is invariant under A, and so defines a function on A\D. It is a 
homeomorphism from A\D onto D, and therefore defines a complex structure on A\D. 

Let p be the quotient map D A\D. The map / f o p is a. bijection from the 
holomorphic functions onU C A\D to the holomorphic functions ofz'" on p~^(U) C D ; 
but these are precisely the holomorphic functions on p~^(U) invariant under the action of 
A. 
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Example 2.20 Let X = {z eC \ S(z) > c} (some c). Fix an integer h, and let « e Z act 
on X as z z + nh. Add a point "oo" and define a topology on X* = X U {00} as follows: 
a fundamental system of neighbourhoods of a point in X is as before; a fundamental system 
of neighbourhoods for 00 is formed of sets of the form {z e C \ ^(z) > N}. We can extend 
the action of Z on X to a continuous action on X* by requiring 00 + «/? = 00 for all n € Z. 
Consider the quotient space r\X*. The function 



q(z) 



z = 00, 



is a homeomorphism r\X* D from r\X* onto the open disk of radius e ^jtc/Zj ^j^j 
centre 0. It therefore defines a complex structure on r\X*. 



The complex structure on r(l)\]HI* 

We first define the complex structure on F(1)\]H. Write p for the quotient map EI 
r(l)\H.Let P be a point of r(l)\]HI, and let Q be a point of H mapping to it. 

If Q is not an elliptic point, we can choose a neighbourhood U of Q such that p is a 
homeomorphism U p(U). We define (p(U), p~^) to he a coordinate neighbourhood of 
P. 

If Q is equivalent to / , we may as well take it to equal / . The map z 1-^ |^ defines an 
isomorphism of some open neighbourhood D of / stable under 5 onto an open disk D' with 
centre 0, and the action of 5 on D is transformed into the automorphism ct:z i-> — z of Z)' 
(because it fixes / and has order 2). Thus {S)\D is homeomorphic to {a)\D', and we give 
{S)\D the complex structure making this a bi-holomorphic isomorphism. More explicitly, 
is a holomorphic function defined in a neighbourhood of /, and 5 = ~q) maps it to 

—z~^ — i —l — iz —i+z z — i 
—z^^+i —l+iz —i — z z + i 

Thus z (fri) ^ holomorphic function defined in a neighbourhood of / which is in- 
variant under the action of 5 ; it therefore defines a holomorphic function in a neighbourhood 
of p{i), and we take this to be the coordinate function near p{i). 

The point Q = can be treated similarly. Apply a linear fractional transformation 
that maps Q to zero, and then take the cube of the map. Exphcitly, is fixed by 57", 
which has order 3 (as a transformation). The function z defines an isomorphism 

from a disk with centre onto a disk with centre 0, and ( ^~^ )^ is invariant under ST . 

It therefore defines a function on a neighbourhood of p(p^), and we take this to be the 
coordinate function near p{p^). 

The Riemann surface r'(l)\]HI we obtain is not compact — to compactify it, we need to 
add a point. The simplest way to do this is to add a point 00 to H, as in (2.20), and use 
the function q{z) = e.xp{2Ttiz) to map some neighbourhood U = {z e IHI | 3(z) > A'^} of 
00 onto an open disk V with centre 0. The function q is invariant under the action of the 
stabilizer of {T) of 00, and so defines a holomorphic function q: {T)\U V, which we 
take to be the coordinate function near p(oo). 

Alternatively, we can consider M* = H UF^ (Q), i.e., H* is the union of IH with the 
set of cusps for r(l). Each cusp other than*^ 00 is a rational point on the real axis, and 

^We sometimes denote coby ioo and imagine it to be at the end of the imaginary axis. 
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is of the form aoo for some a e r{l) (see 2.1 1). Give aoo the fundamental system of 
neighbourhoods for which a is a homeomorphism. Then r(l) acts continuously on H*, and 
we can consider the quotient space r(l)\H*. Clearly, r(l)\]H* = (r(\)\H) U {oo}, and 
we can endow it with the same complex structure as before. 

Proposition 2.21 The Riemann surface r(l)\M* is compact and of genus zero; it is 
therefore isomorphic to the Riemann sphere. 

Proof. It is compact because D U {oo} is compact. We sketch four proofs that it has genus 
0. First, by examining carefully how the points of D are identified, one can see that it 
must be homeomorphic to a sphere. Second, show that it is simply connected (loops can be 
contracted), and the Riemann sphere is the only simply connected compact Riemann surface 
(Uniformization Theorem 0.1). Third, triangulate it by taking p, i, and oo as the vertices of 
the obvious triangle, add a fourth vertex not on any side of the triangle, and join it to the 
first three vertices; then 2 — 2g = 4 — 6 + 4 = 2. Finally, there is a direct proof that there is a 
function j holomorphic on r\M and having a simple pole at oo — it is therefore of valence 
one, and so defines an isomorphism of F\]HI* onto the Riemann sphere. □ 

The complex structure on TXEI* 

Let F C F(l) of finite index. We can define a compact Riemann surface F\]HI* in much the 
same way as for r'(l). The complement of r\M in /^\]HI* is the set of equivalence classes 
of cusps for F.^ 

First r\M is given a complex structure in exactly the same way as in the case F = r(l). 
The point oo will always be a cusp (F must contain for some h, and is a parabolic 
element fixing oo). If h is the smallest power of T in then the function q = exp{2niz/ h) 
is a coordinate function near oo. Any other cusp for F is of the form aoo for o e F{i), and 
z q{(y~^ (z)) is a coordinate function near aoo. 

We write Y{F) = r\M and X{F) = F\W . We abbreviate Y{F{N)) to Y{N), X{F{N)) 
to X{N), Y{Fo{N)) to Yq{N), X{Fo{N)) to Zo(A^) and so on. 

The genus of X{r) 

We now compute the genus of X{F) by considering it as a covering of X{F{\)). According 



where m is the degree of the covering X{F) X{F{\)) and is the ramification index at 
the point P . The ramification points are the images of elliptic points on H* and the cusps. 

Theorem 2.22 Let F he a subgroup of F{1) of finite index, and let vi =the number of 
inequivalent elliptic points of order 2; vj, =the number of inequivalent elliptic points of 
order 3; v^o = the number of inequivalent cusps. Then the genus of X(F) is 



to (1.27) 




or 



g= l-,n + ^(ep-l)/2. 



g = 1 +,m/12- V2/4- V3/3- Voo/2. 



^Exercise: check that F\EI* is Hausdorff. 
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Proof. Let p be the quotient map H* r(l)\H*, and let (p be the map r\H* 
r{\)\H*. If Q is a point of M* and P' and P are its images in r\M* and r(l)\M* then 
the ramification indices multiply: 

e{Q/P) = e{Q/P')-e{P'/P). 

If g is a cusp, then this formula is not useful, d& e{Q/ P) = oo — e{Q/ P') (the map p is 
oo : 1 on every neighbourhood of oo). For 2 € H and not an elliptic point it tells us P' is 
not ramified. 

Suppose that P = p(i), so that Q is F(l)-equivalent to i. Then either e(Q/ P') = 2 
or e{P' / P) — 2. In the first case, Q is an elliptic point for F and P' is unramified over 
P; in the second, Q is not an elliptic point for F, and the ramification index of P' over P 
is 2. There are V2 points P' of the first type, and {m — V2)/2 points of the second. Hence 
J2(ep'-l) = {m-V2)/2. 

Suppose that P — p(p), so that Q is r(l)-equivalent to p. Then either e(Q/P') — 3 
or e(P'/P) = 3. In the first case, Q is an elliptic point for F and P' is unramified over 
P; in the second, Q is not an elliptic point for F, and the ramification index of P' over P 
is 3. There are vj, points P' of the first type, and (m — V3)/3 points of the second. Hence 
^(ep/-l) = 2(m-V3)/3. 

Suppose that P = pioo), so that Q is sl cusp for F. There are Voo points P' and 
J2 = fn; hence — 1 = m — Vcxj. 

We conclude: 





-1) 


= (m- 


■ V2) /2 (P ' lying over ^ (/ )) 




-1) 


= 2(m 


- V3)/3 (P' lying over (p(p)) 




-1) 


= (m- 


■ Voo) (P' lying over ^(00)). 



Therefore 

g = l-m + ^(ep-l)/2= l+m/12-V2/4-V3/3-Voo/2. ^ 

Example 2.23 Consider the principal congruence subgroup F(N). We have to compute 
the index of F(N) in F, i.e., the order of SL2(Z/A/^Z). One sees easily that: 

(a) GL2(Z/A^Z) «i nGL2(Z//jp Z) if AT = Y\P? (because Z/A^Z «i H^/Pp'^)- 

(b) The order of GL2(Fp) = (/7^ — l)(/7'^ — p) (because the top row of a matrix in 
Gh2(¥p) can be any nonzero element of and the second row can then be any 
element of k-^ not on the line spanned by the first row). 

(c) The kernel of GL2(Z/ /''^Z) GL2(F^) consists of all matrices of the form / + 
p{'' ^) with a,b,c,d e Z/p'^'^Z, and so the order of GL2(Z/;p''Z) is (p'-^)"^ ■ 

(/_1)(^2_^). 

(d) #GL2(Z/p'-Z) = (p(p'-)-#SL2iZ/p'-Z), where (pip') = p'"E)'' = {p-l) p'-^ . 
On putting these statements together, one finds that 

iF(i):F{N))^N'-Y\il-p-^). 

p\N 
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Write r(N) for the image of r(N) in r(l)/{±/}. Then 

(r(i) : riN)) = (r(i) : r(iV))/2, 

unless A'^ = 2, in which case it = 6. 

What are V2, v^, and Voo'? Assume > 1. Then r(N) has no elliptic points — the 
only torsion elements in r(l) are 5* = ~q), ST = (j ~}), (ST)-^, and their conjugates; 
none of these three elements is in r(N) for any A'^ > 1, and because r{N) is a normal 
subgroup, their conjugates can't be either. The number of inequivalent cusps is fii\f /N 
where (Xn = (^(1) : r(N) (see 2.24). We conclude that the genus of r(N)\M* is 

g(N)^l + fiN-iN-6)/l2N (N>1). 

For example, 

A^ = 23456 7 8 9 10 11 
/X = 6 12 24 60 72 168 192 324 360 660 
g = 1 3 5 10 13 26. 

Note that X(2) has genus zero and three cusps. There are similarly explicit formulas for the 
genus of Xo(N) — see Shimura 1971, p25. 

Exercise 2.24 Let G be a group (possibly infinite) acting transitively on a set X, and 
let H he a normal"' subgroup of finite index in G. Fix a point xq in X and let Go be the 
stabilizer of in G, and let Hq be the stabilizer of Xq in //. Prove that the number of orbits 
of H acting on X is 

(G : H)/(Go : Ho). 
Deduce that the number of inequivalent cusps for r(N) is fiN / ^■ 

Remark 2.25 Recall that Liouville's theorem states the image of a nonconstant entire 
function (holomorphic function on the entire complex plane C) is unbounded. The Little 
Picard Theorem states that the image of such a function is either C or C with one point 
omitted. We prove this. 

In Example 2.23, we showed that X(2) has genus zero and three cusps, and hence it is 
isomorphic to C \ {two points}. Therefore an entire function / that omits two values can be 
regarded as a holomorphic function / : C ^ X(2). Because C is simply connected, / will 
lift to a function to the universal covering of X(2), which is the open unit disk. The lifted 
function is constant by Liouville's theorem. 

Remark 2.26 The Taniyama-Weil conjecture says that, for every elliptic curve E over Q, 
there exists a surjective map Xo(N) — ^ E, where A'^ is the conductor of E (the conductor of 
E is divisible only by the primes where E has bad reduction). The conjecture is suggested by 
studying zeta functions (see later). For any particular A'^, it is possible to verify the conjecture 
by listing all elliptic curves over Q with conductor A^, and checking that there is a map 
XoiN) — ^ £. It is known (Frey, Ribet) that the Taniyama-Weil conjecture implies Fermat's 
last theorem. Wiles (and Taylor) proved the Taniyama-Weil conjecture for sufficiently many 

'"The exercise becomes false without this condition. For example, take H = Foip) and xq = oo; then 
[G.H] = p+l and [Goo: ^^oo] = 1> ™d the formula in the exercise gives us p+l for the number of inequivalent 
cusps for Poip). However, we know that Foip) has only 2 cusps. 
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elUptic curves to be able to deduce Fermat's last theorem, and the proof of the Taniyama-Weil 
conjecture was completed for all elliptic curves over Q by Breuil, Conrad, Diamond, and 
Taylor. See: Darmon, Henri, A proof of the full Shimura-Taniyama-Weil conjecture is 
announced. Notices Amer. Math. Soc. 46 (1999), no. 11, 1397-1401. 

An eUiptic curve for which there is a nonconstant map Xo{N) E for some A'^ is called 
a modular elliptic curve; contrast elliptic modular curves which are the curves of the form 
F\EI* for r a congruence subgroup of r{\). 

Aside 2.27 Adomain is a comected open subset of some space C". Abounded symmetric domain 

X is a bounded domain that is symmetric in the sense that each point of X is an isolated fixed point 
of an involution of X (holomorphic automorphism of X of order 2). A complex manifold isomorphic 
to a bounded symmetric domain is called a hermiUan symmetric domain (or, loosely, a bounded 
symmetric domain). 

For example, the unit disk D is a bounded symmetric domain — the origin is the fixed point of the 
involution z i-> — z, and, since Aut(Z)) acts transitively on D, this shows that every other point must 
also be the fixed point of an involution. As H is isomorphic to D, it is a hermitian symmetric domain. 
Every hermitian symmetric domain is simply connected, and so (by the Uniformization Theorem) 
every hermitian synunetric domain of dimension one is isomorphic to the complex upper half plane. 

The hermitian symmetric domains of all dimensions were classified by Elie Cartan, except for 
the exceptional ones. Just as for H, the group of automorphisms A\xt{X) of a hermitian symmetric 
domain X is a Lie group, which is simple if X is indecomposable (i.e., not equal to a product of 
hermitian symmetric domains). There are hermitian symmetric domains attached to groups of type 
An (n > 1), B„ (n > 2), C„ (n > 3), £)„ (« > 4), £5, £7. 

Let X be a hermitian symmetric domain. There exist many semisimple algebraic groups G 
over Q and surjective homomorphisms G(IR)"'" -> Aut(X)+ with compact kemel — the + denotes the 
identity component for the real topology. For example, we saw above that every quaternion algebra 
over Q that splits over M gives rise to such a group for H. Given such a G, one defines congruence 
subgroups r C G(Z) just as for SL2(Z), and studies the quotients. 

In 1964, Baily and Borel showed that each quotient r\X has a canonical structure as an algebraic 
variety; in fact, they proved that r\X could be realized in a natural way as a Zariski-open subvariety 
of a projective algebraic variety r\X* . 

Various examples of these varieties were studied by Poincare, Hilbert, Siegel, and many others, 
but Shimura began an intensive study of their arithmetic properties in the 1960s, and they are now 
called Shimura varieties. 

Given a Shimura variety r\X, one can attach a number field E to it, and prove that the Shimura 
variety is defined, in a natural way, over E. Thus one obtains a vast array of algebraic varieties 
defined over number fields, all with very interesting arithmetic properties. In this corpse, we study 
only the simplest case. 
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In this section, we review some of the theory of elliptic functions. For more details, see 
Cartan 1963, V 2.5, VI 5.3, or Milne 2006, III 1,2. 

Lattices and bases 

Let coi and 0)2 be two nonzero complex numbers such that r = (Wi/(W2 is imaginary. By 
interchanging coi and C02 if necessary, we can ensure that t = coi/a)2 lies in the upper half 
plane. Write 



so that A is the lattice generated by coi and a>2- We are interested in A rather than the basis 
{coi , (02}- A second pair of elements of A, 



Therefore, the ordered bases {q)[,co2) of A with ^s(co[/(i)2) > are those of the form 



Any parallelogram with vertices zq, zq + coi, zq + co\ + 0)2, zo + f^2> where {ft)i,(i;2} is a 
basis for A, is called Si fundamental parallelogram for A. 

Quotients of C by lattices 

Let yl be a lattice in C (by which I always mean a full lattice, i.e., a set of the form Zwi + ^,0)2 
with 0)1 and 0)2 linearly independent over R). We can make the quotient space C/ A into a 
Riemann surface as follows: let g be a point in C and let P be its image C/ A; then there 
exist neighbourhoods V of Q and U of P such that the quotient map p:C ^ C/ A defines 
a homeomorphism V ^ U; we take every such pair (U, : V ^ U) to he a coordinate 
neighbourhood. In this way we get a complex structure on C/A having the following 
property: the map p:C ^ C/ A is holomorphic, and for every open subset U of C/A, a 
function f:U ^ C is holomorphic if and only if f o p is holomorphic on p~^(U). 

Topologically, C/ A (M/Z)^, which is a single-holed torus. Thus C/A has genus 
1. All spaces C/A are homeomorphic, but, as we shall see, they are not all isomorphic as 
Riemann surfaces. 

Doubly periodic functions 

Let 71 be a lattice in C. A meromorphic function / (z) on the complex plane is said to be 
doubly periodic with respect to A if it satisfies the functional equation: 



A = 1^0)1 + Za»2, 



a)[= ao)i+bo)2. 0)2 = co}\ + do)2, a,b,c,d eZ, 
is a basis for A if and only if det ( ^ ^ ) = it 1 . The calculation on p6 shows that the ratio 

/ def / / / 1 

r = a»j/a»2 has imagmary part 





f{z + a)) = f (z) for every o) € A. 
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Equivalently, 

( /(z + coi) = /(z) 
I /(Z+CO2) = /(z) 

for some basis {coi , 0)2} for A. 

Proposition 3.1 Let / (z) be a doubly periodic function for A, not identically zero, and 
let D be a fundamental parallelogram for A such that f has no zeros or poles on the 
boundary of D . Then 

(a) Ep6i)Ordp(/) = 0; 

(b) EpeDRes/>(/) = 0; 

(c) Ep6Dordp(/)-P = 0(modA). 

The second sum is over the points of D where f has a pole, and the other sums are over the 
points where it has a zero or pole. Each sum is finite. 

Proof. Regard / as a function onC/A. Then (a) and (b) are special cases of (a) and (b) 
of Proposition 1.12, and (c) is obtained by applying (1.12b) to z • f'(z)/f(z). □ 

Corollary 3.2 A nonconstant doubly periodic function has at least two poles. 

Proof. A doubly periodic function that is holomorphic is bounded in a closed period 
parallelogram (by compactness), and hence on the entire plane (by periodicity); so it is 
constant, by Liouville's theorem. A doubly periodic function with a simple pole in a period 
parallelogram is impossible, because by (3.1a) the residue at the pole would be zero, and so 
the function would be holomorphic. □ 

Endomorphisms otC/ A 

Note that yl is a subgroup of the additive group C, and so C/7l has a natural group structure. 

Proposition 3.3 Let A and A' be two lattices in C. An element a e C such thataA c A' 
defines a holomorphic map 

fa-C/A^C/A', [z]^[az], 
sending [0] to [0], and every such map is of this form (for a unique a). In particular, 

ZcEnd(C/yl). 

Proof. It is obvious that a defines such a map. Conversely, let (p:C/ A ^ C/ A' be a 
holomorphic map such that V'CIO]) = [0]- Then C is the universal covering space of both 
C/A and C/A', and a standard result in topology shows that (p lifts to a continuous map 
^: C ^ C such that ^(0) = 0: 

C > C 



C/A > C/A'. 
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Because the vertical maps are local isomorphisms, (p is automatically holomorphic. For any 
0) e A, the map z (p{z + 0)) — (p{z) takes values in A' c C. It is a continuous map from 

connected space C to a discrete space A' , and so it must be constant. Therefore (p' ^ is 
doubly periodic function, and so defines a holomorphic function £./ A—^ which must be 
constant (because C/7l is compact), say (p'{z) — a. Then (p{z) — az + ^, and the fact that 
^(0) — implies that j6 = 0. □ 

Corollary 3.4 Every holomorphic map (p:C/ A C/ A' such that(p{0) — Oisahomo- 
morphism. 

Proof. Clearly [z] [az] is a homomorphism. □ 

Compare this with the result (AG, 7.14): every regular map f.A^A' from an abelian 
variety A to an abelian variety A' such that (p(0) = is a homomorphism. 

Corollary 3.5 The Riemann surfaces C/ A and C/ A' are isomorphic if and only if 
A' — a A for some a eC^ . 

Corollary 3.6 Either End(C/yl) = Z or there is an imaginary quadratic field K such 
that R is a sabring of Ok of rank 2 over Z. 

def 

Proof. Write A = Zcoi + Za>2 with r = (Wi/(W2 e H, and suppose that there exists an 
a € C, a ^ Z, such that aA c A. Then 

a(0\ = acoi + b(i>2 
aa)2 = cco\ + d(02, 

with a,b,c,d elj. On dividing through by 0)2 we obtain the equations 

ax = ax + b 
a = or + d. 

Note that c is nonzero because 0; ^ Z. On eliminating a from between the two equations, we 
find that 

cr^ + (d — a)r — b = 0. 

Therefore Q[t] is of degree 2 over Q. On eliminating r from between the two equations, we 
find that 

a'^ — {a + d)a + ad — be — 0. 
Therefore a is integral over Z, and hence is contained in the ring of integers of Q[t]. □ 

The Weierstrass p -function 

We want to construct some doubly periodic functions. Note that when G is a finite group 
acting on a set 5, then it is easy to construct functions invariant under the action of G: for 
any function h:S ^ C, the function f(s) = '^g^G ^(S^^) is invariant under G, because 
fig's) = ^g^G^(s's^) = all invariant functions are of this form, obviously. 

When G is not finite, one has to verify that the series converges — in fact, in order to be able 
to change the order of summation, one needs absolute convergence. Moreover, when 5 is a 
Riemann surface and h is holomorphic, to ensure that / is holomorphic, one needs that the 
series converges absolutely uniformly on compact sets. 
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Now let (p{z) be a holomorphic function C and write 

0(z) = + 

IDEA 

Assume that as |z| — ^ oo, (p(z) ^ so fast that the series for (P(z) is absolutely convergent 
for all z for which none of the terms in the series has a pole. Then 0(z) is doubly periodic 
with respect to A; for replacing z by z + coo for some ooo € A merely rearranges the terms 
in the sum. This is the most obvious way to construct doubly periodic functions; similar 
methods can be used to construct functions on other quotients of domains. 

To prove the absolute uniform convergence on compact subsets of such series, the 
following test is useful. 

Lemma 3.7 Let D be a bounded open set in the complex plane and letc>l be constant. 
Suppose that xj/(z,Q)), co e A, is a function that is meromoq)hic in z for each co and which 
satisfies the condition^^ 

\lf{z,mo)i +na)2) = 0((m^ + n^)^'^) asm^ + 11^ ^ oo (2) 

uniformly in z for z in D. Then the series Xlojeyi with finitely many terms which 

have poles in D deleted, is uniformly absolutely convergent in D. 

Proof. That only finitely many terms can have poles in D follows from (2). This condition 
on ^Jf means that there are constants A and B such that 

\xl/{z,m(jOi +nco2)\ < B{rn^ + n^)^'^ 

whenever m-^ + > A. To prove the lemma it suffices to show that, given any £ > 0, there 
is an integer A'^ such that 5 < e for every finite sum 5* = ^ 1 1/' (z,mcoi + 110)2) I in which all 
the terms are distinct and each one of them has m-^ + > 2N'^. Now S consists of eight 
subsums, a typical member of which consists of the terms for which m>n>Q. (There is 
some overlap between these sums, but that is harmless.) In this subsum we have m> N 
and ^ff < Bm^-^'^, assuming as we may that IN'^ > A; and there are at most m + I possible 
values of n for a given m. Thus 

00 

S<J2 Bm-^'{m+\)<BiN^-^' 

m=N 

for a suitable constant Bi, and this proves the lemma. □ 

We know from (3.1) that the simplest possible nonconstant doubly periodic function 
is one with a double pole at each point of A and no other poles. Suppose / (z) is such a 
function. Then f(z) — f (— z) is a doubly periodic function with no poles except perhaps 
simple ones at the points of A. Hence by the argument above, it must be constant, and since 
it is an odd function it must vanish. Thus /(z) is even,'^ and we can make it unique by 
imposing the normalization condition / (z) = z~^ + O(z^) near z = — it turns out to be 
convenient to force the constant term in this expansion to vanish rather than to assign the 

'^The expression /(z) = 0{(p(z)) means that |/(z)| < Ccp{z) for some constant C (independent of z) for 
all values of z in question. 

'^Recall that a function f(x) is even if f(—x) = /(x)and odd if f(—x) = —f(x). 
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zeros of / (z). There is such an / (z) — indeed it is the Weierstrass function p(z) — but we 
can't define it by the method at the start of this subsection because if (p(z) = z~-^, the series 
0(z) is not absolutely convergent. However, if (p(z) = — 2z^^, we can apply this method, 
and it gives p', the derivative of the Weierstrass p-function. Define 



p'(z;A) = p'(z;coi,a)2) = -2 XI 



co€A 

Hence 



(Z-W)3' 



Z2+ ^ {{z-(Oy CO^) 



Theorem 3 . 8 Let Pi,..., Pn and Qi, Q„ be two sets of n >2 points in the complex 
plane, possibly with repetitions, but such that no Pi is congruent to a Qj modulo A. If 
^Pi =J2Qj ™od A, then there exists a doubly periodic function f{z) whose poles are 
the Pi and whose zeros are the Qj with correct multiplicity, and f(z) is unique up to 
multiplication by a nonzero constant. 

Proof. There is an elementary constructive proof. Alternatively, it follows from the 
Riemann-Roch theorem applied toC/ A. □ 

The addition formula 

Consider p(z + z'). It is a doubly periodic function of z, and therefore it is a rational 
function of p and p' . 

Proposition 3.9 There is the following formula: 

l\ p'(z)-p'(z') ]^ 

p(z + z) = -l—— — - -p(z)-p(z). 

4 ( p(z) - p{z') ) 

Proof. Let / (z) denote the difference between the left and the right sides. Its only possible 
poles (in D) are at 0, or itz', and by examining the Laurent expansion of / (z) near these 
points one sees that it has no pole at or z, and at worst a simple pole at z' . Since it is 
doubly periodic, it must be constant, and since / (0) = 0, it must be identically zero. □ 

Eisenstein series 

Write 



GkiA) = X co- 



-2k 

u 

and define Gkiz) = Gic(zZ + Z). 



Proposition 3.10 The Eisenstein series Gk{z), k > \, converges to a holomorphic func- 
tion on H; it takes the value 2^(2k) at infinity. (Here ^(s) = '^n~^, the usual zeta function.) 

Proof. Apply Lemma 3.7 to see that Gk(z) is a holomorphic function on H. It re- 
mains to consider Gyt(z) as z — ^ / oo (remaining in D, the fundamental domain for r(l)). 
Because the series for G^^z) converges uniformly absolutely on D, \imz^iooGk{z) = 
^limz_j.,oo l/(wjz -\-n)^^ . But limz^/oo l/(mz -\-n)^^ = unless m = 0, and so 

lim Git(z)= l/«2^ = 2y"l/«2'^ = 2^(2yt). 

n€Z,n^O n>l □ 
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The field of doubly periodic functions 

Proposition 3.11 The field of doubly periodic functions is just C(p{z),p'(z)), and 

p'izf = 4p(zf-g2p(z)-g3 

where g2 = 6OG2 and = I4OG3. 

Proof. To prove the second statement, define / (z) to be the difference of the left and the 
right hand sides, and show (from its Laurent expansion) that it is holomorphic near and 
take the value there. Since it is doubly periodic and holomorphic elsewhere, this implies 
that it is zero. 

For the first statement, we begin by showing that every even doubly periodic function / 
lies in C(p). Observe that, because f(z) = f(—z), the kth derivative of /, 

/('^)(z) = (-l)V^'^(-^). 

Therefore, if / has a zero of order m at zq, then it has a zero of order m at — zq. On applying 
this remark to 1 //, we obtain the same statement with "zero" replaced by "pole". Similarly, 
because /*^^^+^)(zo) = — f^-^^'^^\—zo), if zq = — zq mod A, then the order of zero (or 
pole) of / at Zq is even. Choose a set of representatives mod A for the zeros and poles of / 
not in A and number them zi, . . .,Zm, — zi, • . . ,— z^, z^+i, ■■■,Zn so that (modulo A) 

Zi ^-Zi, \<i <m 

Zi = —Zi ^0, m <i < n. 

Let trii be the order of / at z, ; according to the second observation, m, is even for i > m. 
Now p(z) — p{zi) is also an even doubly periodic function. Since it has exactly two poles 
in a fundamental domain, it must have exactly two zeros there. When i < m, it has simple 
zeros at itz, ; when / > m, it has a double zero at z, (by the second observation). Define 

g{z) = nr=i(p(2)-p(^/)r'' •nr=^+i(p(z)-pfe)r'/'. 

Then / (z) and g(z) have exactly the same zeros and poles at points z not on A. We deduce 
from (3.1a) that they also have the same order at z = 0, and so f/g, being holomorphic and 
doubly periodic, is constant: f = eg € C(p). Now consider an arbitrary doubly periodic 
function /. Such an / decomposes into the sum of an even and of an odd doubly periodic 
function: 

■'^^^ 2 2 ' 

We know that the even doubly periodic functions lie in C(p), and clearly the odd doubly 
periodic functions he in p' ■ C(p). □ 

Elliptic curves 

Let k he a field of characteristic 2, 3. By an elliptic curve over k, I mean a pair (E,0) 
consisting of a nonsingular projective curve E of genus one and with a point 6 E(k). From 
the Riemann-Roch theorem, we obtain regular functions x and y on E such that x has a 
double pole at and y a triple pole at 0, and neither has any other poles. Again from the 
Riemann-Roch theorem applied to the divisor 6 • 0, we find that there is a relation between 1, 
X, x^, x^ , y, y-^, xy, which can be put in the form 

= 4x^ — ax — b. 
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The fact that E is nonsingular imphes that Zi = — 21 ^ 0. Thus E is isomorphic to the 
projective curve defined by the equation, 

Y^Z = AX^ -aXZ^-bZ^, 

and every equation of this form (with A^O) defines an elliptic curve. Define 

j{E) = \121>a^/A. 

If the elUptic curves E and E' are isomorphic then j{E) — j{E'), and the converse is true 
when k is algebraically closed. If £ is an elliptic curve over C, then E(C) has a natural 
complex structure — it is a Riemann surface. (See Milne 2006 for proofs of these, and other 
statements, about elliptic curves.) 

An elliptic curve has a unique group structure (defined by regular maps) having as its 
zero. 

The elliptic curve E(A) 

Let yl be a lattice in C. We have seen that 

p'(z)2 = 4p(z)^-g2piz)-g3. 
Let E(A) be the projective curve defined by the equation: 

Y^Z = 4X^-g2XZ^-g3Z\ 
Proposition 3.12 The curve E(A) is an elUptic curve (i.e., A ^ 0), and the map 



C/A^E{A), 



z\-^{p(z):p'(z):l), z^O 
^ (0 : 1 : 0) 



is an isomorphism of Riemann surfaces. Every elliptic curve E is isomorphic to E{A) for 
some A . 

Proof. The first statement is obvious. There are direct proofs of the second statement, but 

we shall see in the next section that z A(zZ + Z) is a modular function for F(l) with 
weight 12 having no zeros in H, and that z 7 (zZ + Z) is a modular function and defines a 
bijection F(1)\BI C (therefore every j equals j(A) for some lattice Zz + Z, z e H). □ 

The addition formula shows that the map in the proposition is a homomorphism. 

Proposition 3.13 There are natural equivalences between the following categories: 

(a) Objects: Elliptic curves E over C. 

Morphisms: Regular maps E ^ E' that are homomorphisms. 

(b) Objects: Riemann surfaces E of genus 1 together with a point 0. 
Morphisms: Holomorphic maps E —>■ E' sending to 0'. 

(c) Objects: Lattices vl C C. 

Morphisms: }iom(A,A') — {a e C \ aA c A'}. 

Proof. The functor c ^ b is A C/A. The functor a ^ b is (E,0) iE(C),0), 
regarded as a pointed Riemann surface. □ 
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4 Modular Functions and Modular Forms 
Modular functions 

Let F be a subgroup of finite index in r(l). A modular function for F isa meromorphic 
function on the compact Riemann surface /^\H*. We often regard it as a meromorphic 
function on H* invariant under F. Thus, from this point of view, a modular function / for 
F is a function on H satisfying the following conditions: 

(a) / (z) is invariant under F, i.e., / (yz) = / (z) for all y € F; 

(b) / (z) is meromorphic in H; 

(c) / (z) is meromorphic at the cusps. 

For the cusp ioo, the last condition means the following: the subgroup of F(l) fixing 

ioo is generated by T = (J }) — it is free abelian group of rank 1; the subgroup of F fixing 
/oo is a subgroup of finite index in (T), and it therefore is generated by (q i) for some 
€ N, (/z is called the width of the cusp); as / (z) is invariant under (q ^ ), /(z + /z) = /(z), 
and so /(z) can be expressed as a function f*(q) of the variable q = exp(27r/z/ h); this 
function f*(q) is defined on a punctured disk, < |^| < £, and for / to be meromorphic at 
ioo means /* is meromorphic at ^ = 0. 

For a cusp r ^ ioo, the condition means the following: we know there is an element 
a e F(l) such that r = or (ioo); the function z /(cz) is invariant under aFa~^, and 
/ (az) is required to be meromorphic at i oo in the above sense. 

Of course (c) has to be checked only for a finite set of representatives of the F- 
equivalence classes of cusps. 

Recall that a function / (z) that is holomorphic in a neighbourhood of a point a eC 
(except possibly at a) is holomorphic at a if and only if / (z) is bounded in a neighbourhood 
of a. It follows that /(z) has a pole at a, and therefore defines a meromorphic function 
in a neighbourhood of a, if and only if (z — a)" f (z) is bounded near a for some n, i.e., if 
/ (z) = 0((z — a)~") near a. When we apply this remark to a modular function, we see 
that / (z) is meromorphic at / oo if and only if f*(q) = 0(q~") for some n as ^ ^ 0, i.e., 
if and only if, for some ^ > 0, e^'^ ■ f (z) is bounded as z ^ / oo. 

Example 4. 1 As F(l) is generated by S and T, to check condition (a) it suffices to verify 
that 

/(-l/z) = /(z), /(z + l) = /(z). 
The second equation implies that / = f*(q), q = exp(2niz), and condition (c) says that 

r(q)= Yl 

n>-No 

Example 4.2 The group F(2) is of index 6 in F(l). It is possible to find a set of generators 
for F(2) just as we found a set of generators for F(\), and again it suffices to check 
condition (a) for the generators. There are three inequivalent cusps, namely, ioo, S(ioo) = 
(0 -1) (1) = (0) = 0, and TSiioo) = 1. Note that S(0) = ioo. The stabilizer of ?oo in 
F(2) is generated by (q j), and so /(z) = f*{q), q — exp(27tiz/2), and for /(z) to be 
meromorphic at ioo means /* is meromorphic at 0. For / (z) to be meromorphic at means 
that / (Sz) — /(— 1/z) is meromorphic at ioo, and for / (z) to be meromorphic at 1 means 
that /(I — is meromorphic at / oo. 
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Proposition 4.3 There exists a unique modular function J forr(l) which is holomorphic 
except at ioo, where it has a simple pole, and which takes the values 



Proof. From Proposition 2.21 we know there is an isomorphism of Riemann surfaces 
/ : F(1)\]H* S (Riemann sphere). Write a, b, c for the images of p, i, oo. Then there 
exists a (unique) linear fractional transformation'^ S —>■ S sending a, c to 0, 1, oo, and on 
composing / with it we obtain a function / satisfying the correct conditions. 

If is a second function satisfying the same conditions, then go is an automorphism 
of the Riemann sphere, and so it is a linear fractional transformation. Since it fixes 0, 1, oo it 
must be the identity map. □ 

Remark 4.4 Let j(z) — niSg^/A, as in Section 3. Then j(z) is invariant under r(l) 
because ^| and A are both modular forms of weight 12 (we give all the details for this 
example later). It is holomorphic on H because both of g| and A are holomorphic on H, 
and A has no zeros on H. Because A has a simple zero at oo, j has a simple pole at oo. 
Therefore j(z) has valence one, and it defines an isomorphism from /^\]HI* onto S (the 
Riemann sphere). In fact, j(z) — 1728/(z). 

Modular forms 

Let F be a subgroup of finite index in F(l). 

Definition 4.5 A modular form for F of weight 2k is a function on H such that: 

(a) /(yz) = (cz + df" ■ f(z), all z € M and all y = ^) € T; 

(b) / (z) is holomorphic in H; 

(c) / (z) is holomorphic at the cusps of F. 

A modular form is a cusp form if it is zero at the cusps. 

For example, for the cusp ioo, this last condition means the following: let h be the width 
of ioo as a cusp for F; then (a) implies that f(z + h) = f{z), and so /(z) = f*(q) for 
some function /* on a punctured disk; /* is required to be holomorphic a.tq = 0. 

Occasionally we shall refer to a function satisfying only (4.5a) as being weakly modular 
of weight 2k, and a function satisfying (4.5a,b,c) with "holomorphic" replaced by "meromor- 
phic" as being a meromorphic modular form of weight 2k. Thus a meromorphic modular 
form of weight is a modular function. 

As our first examples of modular forms, we have the Eisenstein series. Let C be the set 
of lattices in C, and write A{a)\,(i)2) for the lattice Zwi + Za;2 generated by independent 
elements (Wi, 002 with "^^{001/ 0)2) > 0. Recall that A{u)'-^^,(jo'2) = A(cl)i,(j)2) if and only if 



'■'Recall (p30) that the linear fractional transformations of P^(C) are the maps z cf+¥ witha,ft.c,J e Z, 
ad — be ^ 0. For any distinct triples (z\, 22,23) and (Zj , Z2, Z3) of distinct points in (C), there is a unique 
linear fractional transformation sending zi to Zj, Z2 to z'2, and Z3 to Z3 (see the Wikipedia). 



J{i) = 1 , /(p) = 0. 
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Lemma 4.6 Let F:C^Cbea function of weightlk, i.e., such that F{XA) = A"^'^ ■ F{A) 

def 

for X e . Then f(z) = F(A(z, 1)) is a weakly modular form on H of weight 2k and 
F f is a bijection from the functions of weight 2k on C to the weakly modular forms of 
weight 2k on M. 

Proof. Write F((jOi , a>2) for the value of F at the lattice A(a)i , 0)2). Then because F is of 
weight 2k, we have 

F(Xo)i , Xo)2) = A~^^ ■ F(o)i , C02) , A e , 

and, because F {0)1,0)2) depends only on A(o)i,o)2), it is invariant under the action of 
SL2(Z) : 

F{ao)i+bu)2,cu)\ + do)2) = F{o)i,o)2),3i^ e SL2(Z). (3) 

The first equation shows that a;|^ ■ F {0)1,0)2) is invariant under {o)\,o)2) 1-^ {Xo)\,Xo)2), 
X e C^, and so depends only on the ratio 0)1/0)2; thus there is a function / (z) such that 

F{o)i,o)2) = o)2^^-f{(Oi/a)2). (4) 

When expressed in terms of /, (3) becomes 

{ccoi + doi)2)^'^^ ■ f{ao)i + bo)2/co)i + do)2) = 0)2'^^ ■ /(o>i/o>2), 

or 

{cz + d)^^'' ■ f{az + b/cz + d) = f{z). 

This shows that / is weakly modular Conversely, given a weakly modular /, define F by 
the formula (4). □ 

Proposition 4.7 The Eisenstein series G]^{z), k > I, is a modular form of weight 2k for 
F{1) which takes the value 2^ {2k) at infinity. 

Proof. Recall that we defined Gk{A) = T,co€A,co^o^/^^'' ■ Clearly, Gk{XA) = X-^''Gk{A), 
and therefore 

Gk{z)= Gk{A{z,l))= J2 Uimz + n)^'' 

is weakly modular. That it is holomorphic on IHI and takes the value 2^ {2k) at ioo is proved 
in Proposition 3.10. □ 

Modular forms as k-iold differentials 

The definition of modulai" form may seem strange, but we have seen that such functions 
arise naturally in the theory of elliptic functions. Here we give another explanation of the 
definition. For the experts, we shall show later that the modular forms of a fixed weight 2k 
are the sections of a line bundle on FyiHI*. 
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Remark 4.8 Consider a differential co = f (z) ■ dz on M, where / (z) is a meromorphic 
function. Under what conditions on / is invariant under the action of /^? Let y(z) = ; 
then 

y w = f(yz)-d — 

cz + a 

^ (a(cz + t/)-c(az + /))) 

= ^^^^^ (^^^ 

= f(yz)-(cz + d)~'^ -dz. 

Thus ft) is invariant if and only if / (z) is a meromorphic differential form of weight 2. We 

have one-to-one correspondences between the following sets: 
{meromorphic modular forms of weight 2 on IHI for F} 
{meromorphic differential forms on H* invariant under the action of F} 
{meromorphic differential forms on F\]HI*}. 

There is a notion of a k-fold differential form on a Riemann surface. Locally it can be 
written co = f{z)- (dz)^ , and if w = w(z), then 

w*(v = f(w(z)) ■ (dw(z)f = f{w(z)) ■ w'izf ■ (dzf. 

Then modular forms of weight 2k correspond to F -invariant /c-fold differential forms on H*, 
and hence to meromorphic A: -fold differential forms on /^\]HI*. Warning: these statements 
don't (quite) hold with meromorphic replaced with holomorphic (see Lemma 4.1 1 below). 

We say that co = f{z)- {dz)^ has a zero or pole of order m at z = according as / (z) 
has a zero or pole of order m at z = 0. This definition is independent of the choice of the 
local coordinate near the point in question on the Riemann surface. 

The dimension of the space of modular forms 

For a subgroup F of finite index in F(l), we write M.k{r) for the space of modular forms 
of weight 2k for F , and Sk (F) for the subspace of cusp forms of weight 2k. They are vector 
spaces over C, and we shall use the Riemann-Roch theorem to compute their dimensions. 

Note that A^o(^) consists of modular functions that are holomorphic on IHI and at the 
cusps, and therefore define holomorphic functions on F\]HI*. Because F\]HI* is compact, 
such a function is constant, and so A^o(^) = C. The product of a modular form of weight 
k with a modular form of weight £ is a modular form of weight k + i. Therefore, 

^(n = ®,^^Mk(F) 

is a graded ring. The next theorem gives us the dimensions of the homogeneous pieces. 
Theorem 4.9 The dimension ofMk(F) is given by: 



dimiMkiF)) 



f ifk < -1 

1 ifk = 

(2/c-l)(g-l) + Voo^ + i:p[^(l-i7)] ifk>l 



where g is the genus ofX{F) (= F\m*); 
is the number of inequivalent cusps; 
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the last sum is over a set of representatives for the the elliptic points P of F; 
e p is the order of the stabilizer of P in the image F of F in F(l)/{±I}; 
[k(l — l/ep)] is the integer part ofk(l — l/ep). 

We prove the result by applying the Riemann-Roch theorem to the compact Riemann 
surface F\]HI*, but first we need to examine the relation between the zeros and poles of a 
F-invariant k-fo\d differential form on H* and the zeros and poles of the corresponding 
modular form on F\]HI*. It will be helpful to consider first a simple example. 

Example 4.10 Let D be the unit disk, and consider the map w: D ^ D, z z'^ . Let 
Q P . If Q ^ 0, then the map is a local isomorphism, and so there is no difficulty. Thus 
we suppose that P and Q are both zero. 

First suppose that / is a function on D (the target disk), and let /* = f °w. If / 
has a zero of order m (regarded as function of w), then /* has a zero of order em, for 
if f{w) = aw"^ + terms of higher degree, then /(z^) = az^"^ + terms of higher degree. 
Thus 

ordg(/*) = e.ordp(/). 

Now consider a ^-fold differential form <y on D, and let = w*{co). Thenty = f(z)-(dz)^ 
for some /(z), and 

CO* = f(z^) . (dz^f = f(z^) . (ez^-i • dzf = ■ f{z') ■ z^(^-i) • (dzf. 

Thus 

ordQ((W*) = e or d p (co) + k(e — 1). 

Lemma 4.11 Let f be a (meromorphic) modular form of weight 2k, and let co be the 
corresponding k-fold differential form on F\M.*. Let Q eB* map to P e F\B*. 

(a) If Q is an elUptic point with multiplicity e, then 

ordQ(/) = eoTdp(cL>) + k{e — 1). 

(b) If Q is a cusp, then 

ordg (/) = ordp ((w) + k. 

(c) For the remaining points, 

ordQ(/) = ordp((w). 

Proof. Let p be the quotient map H F\M. 

(a) We defined the complex structure near P so that, for appropriate neighbourhoods V 
of Q and U of P, there is a commutative diagram: 

V > D 

P 



Thus this case is isomorphic to that considered in the example. 
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(b) Consider the map q:M ^ (punctured disk), q(z) = exp(2jTiz/ h), and let = 
giq) ■ (dq)'^ be a /:-fold differential form on the punctured disk. Then dq = {2ni/ h)-q ■ dz, 
and so the inverse image of at* on H is 

(o = ( cnst) • g(q{z))-q(zf ■ (dzf , 

and so (o* corresponds to the modular form / (z) = (cnst) • g(q(z)) ■q{z)'^. Thus f*(q) — 
g(q) ■ q^ , which gives our formula. 

(c) In this case, p is, & local isomorphism near Q and P , and so there is nothing to 
prove. □ 

We now prove the theorem. Let / 6 M.]^{r), and let o) be the corresponding /c-fold 
differential on F\E[*. Because / is holomorphic, we must have 

eovAp{a)) + k{e—\) = ordg(/) > at the image of an elliptic point; 
ordp (&;) + /: = ordg(/) > at the image of a cusp; 
ord p{(X>) = ordg (/) > at the remaining points. 

Fix a A: -fold differential coq, and write a> = h-coo- Then 

ord p(h) + ord p((jOo) + k (I — I /e) >0 at the image of an elliptic point; 
ordp(/i) + ordp((Wo) + ^ >0 at the image of a cusp; 
ord p (h) + ord p (coq) > at the remaining points. 

On combining these inequalities, we find that 

div(h) + D > 0, 

where 

D = div(«o) + ■ Pi + J2^k(l - 1/e,)] • Pi 

(the first sum is over the images of the cusps, and the second sum is over the images of 
the elliptic points). As we noted in Corollary 1.21, the degree of the divisor of a 1-fold 
differential form is 2g — 2; hence that of a /c-fold differential form is k{2g — 2). Thus the 
degree of D is 

k(2g-2) + v^-k + J2[k(i-'^/ep)]. 
p 

Now the Riemann-Roch Theorem (1.22) tells us that the space of h's has dimension 

l-g + /:(2g-2) + yoo-^ + ^[/:(l-l/ep)] 

P 

for A; > 1. As the h's are in one-to-one correspondence with the holomorphic modular forms 
of weight 2k, this proves the theorem in this case. For k = 0, we have already noted that 
modular forms are constant, and for A: < it is easy to see that there can be no modular 
forms. 
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Zeros of modular forms 

Lemma 4.1 1 allows us to count the number of zero and poles of a meromorphic differential 
form. 

Proposition 4.12 Let f be a (meromorphic) modular form of weight 2k; then 

J2(ordQ(f)/eQ-kil - l/eQ)) = k(lg - 2) + /c ■ v^o 

where the sum is over a set of representatives for the points in r\M* , v^o is the number of 
inequivalent cusps, and cq is the ramification index of Q over p(Q) if Q e H and it is 1 if 
Q is a cusp. 

Proof. Let co be the associated /r-fold differential form on F\]HI*. We showed above 
that: ordg(/)/eg = ordp((y) + k(l — 1/eg) for Q an elliptic point for F; ordg(/) = 
ordp (ft)) + k for Q a cusp; ordg (/) = ord/> (at) at the remaining points. On summing these 
equations, we find that 

^ordQ(f )/eQ -k(l- I /eg) = deg(div(ct>)) + ^ • Voo, 
and we noted above that deg(div(&))) = k(2g — 2). □ 
Example 4. 13 When F — r{i), this becomes 

ord,oo(/) + ^ord,(/) + ^ordp(/) + ^ordgC/) = -2k + k + h + h = 

Here / oo, i, p are points in H* , and the sum ^ is over the remaining points in a fundamental 
domain. 

Modular forms for r(l) 

We now describe all the modular forms for F(l). 

Example 4. 14 On applying Theorem 4.9 to the full modular group r(l), we obtain the 
following result: = for /: < 0, dim A4o = I, and 

dim Mk = i-k + [k/2] + [2k/3], k>l. 

Thus 

k =1234567 
dimMk = 0111121.... 

In fact, when k is increased by 6, the dimension increases by 1. Thus we have 

(a) Mk = Ofor/t <0; 

(b) dim(X^) = [k/6] if k = I mod 6; [k/6] + 1 otherwise; ^ > 0. 

Example 4. 15 On applying (4.13) to the Eisenstein series G^, k > \, we obtain the 
following result: 

k = 2: G2 has a simple zero at z = p, and no other zeros. 

k — 3: G3 has a simple zero at z = / , and no other zeros. 

k — 6: because A has no zeros in H, it has a simple zero at 00. 
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There is a geometric explanation for these statements. Let A = A(i, 1). Then iA C A, 
and so multiplication by / defines an endomorphism of the torus C/ A. Therefore the eUiptic 
curve 

= 4X^ - g2{A)X - gsiA) 
has complex multiplication by / ; clearly the curve 

has complex multiplication by / (and up to isogeny, it is the only such curve); this suggests 
that g3(A) = 0. Similarly, G2(A) = "because" Y^ = X^ + I has complex multiphcation 
by -yi. Finally, if A had no zero at oo, the family of elliptic curves 

Y^ ^ 4X^ - g2(A)X - g3(A) 

over F(1)\]HI would extend to a smooth family over F(1)\]HI*, and this is not possible 
for topological reasons (its cohomology groups would give a nonconstant local system on 
F(1)\IHI*, but the Riemann sphere is simply connected, and so admits no such system). 

Proposition 4.16 (a) Fork < 0, andk = I, Mk = 0. 

(b) For k = 0,2, 3,4, 5, Ai^ is a space of dimension 1, admitting as basis 1, G2, G3, G4, 
G5 respectively; moreover Skif) = for < A: < 5. 

(c) Multiplication by A defines an isomorphism ofAik-e onto S^. 

(d) The graded k-algebra = C[G2,G3] with G2 and G3 of weights 2 and 3 
respectively. 

Proof, (a) See (4. 14). 

(b) Since the spaces are one-dimensional, and no is identically zero, this is obvious. 

(c) Certainly / fA is a homomorphism Aik-e ^k- But if / e Sk, then f /A e 
M.k-6 because A has only a simple zero at / oo and / has a zero there. Now / \->- f/Ais 
inverse to / fA. 

(d) We have to show that {G^ • G" \2m + 3n = k, m e'M, n e'N} forms a basis for A^^t- 
We first show, by induction on k, that this set generates Ai^. For A: < 3, we have already 
noted it. Choose a pair m >0 and « > such that 2m + 3n = k (this is always possible for 
k >2). The modular form g — G^ • G" is not zero at infinity. If / € Ai^, then / — 

is zero at infinity, and so is a cusp form. Therefore, it can be written A ■ h with h e Ai k-e^ 
and we can apply the induction hypothesis. 

Thus C[G2, G3] ®A4k is surjective, and we want to show that it is injective. If not, 
the modular function G|/G| satisfies an algebraic equation over C, and so is constant. But 
G2(p) = 0^ Gsip) whereas G2(0 7^ = G3(0- □ 

Remark 4. 17 We have verified all the assertions in (4.3). 

The Fourier coefficients of the Eisenstein series for 7^(1) 

For future use, we compute the coefficients in the expansion Gk(z) = ^anq". 
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The Bernoulli numbers B]^ 

They are defined by the formal power series expansion: 

oo 2k 

k=i ^ ' 

Thus Bi = 1/6; B2 = 1/30; ... ; B14 = 23749461029/870; ... Note that they are all rational 
numbers. 

Proposition 4.18 For all integers k >l, 

22/t-i 
' (2k)\ " 



Proof. Recall that (by definition) 

cos(z) = — , sin(z) — 



2i 

Therefore, 

cot(z ) = / —. — = i -—. = i H — — . 

On replacing x with 2iz in the definition of the Bernoulli numbers, we find that 

°° 22k^2k 

z cot(z) = 1 - y —r— (5) 

There is a standard formula 

sin(z) = z 



^n('-i 

n = \ 



(see Cartan 1963, V 3.3). On forming the logarithmic derivative of this (i.e., forming 
d log(/) = f / f) and multiplying by z, we find that 

E°° 2z-^/n^7i-^ 
9,99 

n = l ' 



00 - 



2/. 2 

00 00 2k 



n = l ' 



n=lk=l 
00/00 \ ^2k 



k = l \n = l 



-2k 



Z 



On comparing this formula with (5), we obtain the result. 

For example, ^(2) = ^, ^(4) = t(6) = ... 



Remark 4.19 Until 1978, when Apery showed that f (3) is irrational, almost nothing was 
known about the values of t, at the odd positive integers. 
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The Fourier coefeicients of Gk 
For any integer n and number k, we write 

d\n 

Proposition 4.20 For every integer k > 2, 

^ '' n=l 

Proof. In the above proof, we showed above that 

oo 2 

zcot(z) = 1 + 2^ 



n=l 



and so (replace z with nz and divide by z) 

00 



7rcot(;rz) = - + 2 > — = - + > + 

z ■^^z^ — «^ z ^—'\z + n z — nl 
n=l n=l ^ ^ 



Moreover, we showed that 
and so 

:7r cot(:7rz) = jrz — 



\-q 

oo 

= jrz — 2:7rz ^ ^" 



n=i 



where ^ = e^^^^ . Therefore, 

-+> ^ ]=JTi-27tiy q". 

z ^Vz + « z-n/ 

n=l ^ ^ n=l 



z 

The (A: - l)th derivative of this (k > 2) is 



Now 



1 1 °° 

i^^{n + zf ik-iy} 



(n,mtr(o,0)("^ + '") 

oo ^ 
= 2^(2yt) + 2 W -y 

^ n=la=l 

= 2m) + ^^j:^2k-dnyq". 

n=l 
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The expansion of A and j 

Recall that 

From the above expansions of G2 and G3, one finds that 

Theorem 4.21 (Jacobi) A = {2n)^^q]X^^^{\-q'' f'^, q = e^^'^. 

Proof. Let f(q) = q Hn^i ~ q")'^^- The space of cusp forms of weight 12 has dimen- 
sion 1. Therefore, if we show that /(— 1/z) = z^^/(z), then / will be amultiple of A. It is 
possible to prove by an elementary argument (due to Hurwitz), that /(—1/z) and z^-^f(z) 
have the same logarithmic derivative; therefore 

/(-l/z) = Czi2-/(z), 

some C. Put z = 1 to see C = 1. See Serre 1970, VII.4.4, for the details. □ 

Write q ]^(1 — q")-^'^ = YlT=i ^('^) ' "^^^ function n 1-^ T(n) was studied by Ra- 
manujan, and is called the Ramanujan x-f unction. We have 

t(1) = 1 , t(2) = -24 , r(12) = -370944 

Evidently each r(«) e Z. Ramanujan made a number of interesting conjectures about x{n), 
some of which, as we shall see, have been proved. 

Recall that j{z) = A = gl-21gl g2 = 6OG2, ^3 = UOGb. 

Theorem 4.22 The function 

j{z) = - + 744+ 196884^ + 21 493 760^2 + c(3)^^ + c(4)^'* + ---, q = e^'''\ 

q 

where c(n) for all n . 

Proof. Immediate consequence of the definition and the above calculations. □ 
The size of the coefficients of a cusp form 

Let /(z) = be a cusp form of weight 2k > 2 for some congruence subgroup of 

SL2(Z). For various reasons, for example, in order to obtain estimates of the number of 
times an integer can be represented by a quadratic form, one is interested in | . 

Hecke showed that a„ = 0(n^) — the proof is quite easy (see Serre 1970, VII.4.3, for 
the case of r{i)). Various authors improved on this — for example, Selberg showed in 1965 
that an = 0{n^^'^l'^+'^) for all e > 0. It was conjectured that a„ = 0{n^^'^l'^ ■ ao(n)) (for 
the r-function, this goes back to Ramanujan). The usual story with such conjectures is that 
they prompt an infinite sequence of papers proving results converging to the conjecture, 
but (happily) in this case Deligne proved in 1969 that the conjecture follows from the Weil 
conjectures for varieties over finite fields, and he proved the Weil conjectures in 1973. See 
(11.16) below. 
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Modular forms as sections of line bundles 

Let Z be a topological manifold. A line bundle on Z is a map of topological spaces 
tt: L — ^ X such that, for some open covering X = [jUi of X, Tt~^ (Uj) ^ Ui x M. Similarly, 
a line bundle on a Riemann surface is a map of complex manifolds n:L X such that 
locally L is isomorphic to J7 x C, and a line bundle on an algebraic variety is a map of 
algebraic varieties n:L X such that locally for the Zariski topology on X, L U xA^. 

If L is a line bundle on X (say a Riemann surface), then for any open subset U of 
X, r(U,L) denotes the group of sections of L over U, i.e., the set of holomorphic maps 
f:U—>^L such that tz o f = identity map. Note that if L = J7 x C, then FilJ, L) can be 
identified with the set of holomorphic functions on U. (The F in r{U, L) should not be 
confused with a congruence group F .) 

Now consider the following situation: F is a group acting freely and properly discon- 
tinuously on a Riemann surface H , and X = F\H . Write p for the quotient map H ^ X. 
Let tt: L ^ Z be a Une bundle on X; then 

p*{L) = {(h, l)cHxL\ p{h) = Tt{l)} 

is a line bundle on H (for example, p*(X xL) = H x L), and F acts on p*(L) through 
its action on H. Suppose we are given an isomorphism i:H xC ^ p* (L). Then we can 
transfer the action of F on p*(L) to an action of T on // x C over H. For y e F and 
(t, z)€ H xC, write 

y(r,z) = (Yr,jy(r)z) , jy(r) e C". 

Then 

yy'(T,z) = y(y'r,y>(T)z) = {yY'r,jy{y'r)-jy'{r)-z). 

Hence: 

jyy'ix) = jy{y'x)-jy{x). 

Definition 4.23 An automorphy factor is a map j:F xM.^C^ such that 

(a) for each y e t i-^ 7y (r) is a holomorphic function on H; 

(b) jyy'{r) = jy{Y'x)-jy'{r). 

Condition (b) should be thought of as a cocycle condition (in fact, that's what it is). Note 
that if j is an automorphy factor, so also is for every integer k. 

Example 4.24 For every open subset H oiC with a group F acting on it, there is canonical 
automorphy factor jy(r), namely, 

FxH^C,(y,r)^(dy)r. 

By (dy)T: I mean the following: each y defines a map H ^ H, and (dy)z is the map on the 
tangent space at r defined by y. As // C C, the tangent spaces at r and at yr are canonically 
isomorphic to C, and so (dy)t can be regarded as a complex number. 
Suppose we have maps 




of (complex) manifolds, then for any point m e M, {d(fioa))m = {d^)a{n) ° ida)m (maps 
on tangent spaces). Therefore, 



jyyix) = {dyy')r = {dy)y>r-{dy')r = jyiy'r)- jyir). 



60 



I. The Analytic Theory 



def 

Thus 7y(r) = (dy)-,: is an automorphy factor. 

For example, consider F(l) acting on H. If y = (z cz+d ^' ^^^^ 

and so jyii) = (cz + d)~^, and jyir)'^ = (cz + d)"^^ . 

Proposition 4.25 There is a one-to-one correspondence between the set of pairs (L,i) 
where L is a line bundle on r\H and i is an isomorphism H xC p* (L) and the set of 
automorphy factors. 

Proof. We have seen how to go (L , ; ) jy (t). For the converse, use i and j to define an 
action oi F on H xC, and define L to he r\H xC □ 

Remark 4.26 Every line bundle on H is trivial (i.e., isomorphic to IHI x C), and so Proposi- 
tion 4.25 gives us a classification of the line bundles on r\E.. 

Let L be a line bundle on X. Then 

r(X, L) = {F e r(H, p*L)\F commutes with the actions of T}. 

Suppose we are given an isomorphism p* L ^ H xC We use it to identify the two line 
bundles on H. Then F acts on // x C by the rule: 

y(T,z) = (yr,7y(r)z). 

A holomorphic section F:H—^Hx£. can be written F(r) = (r, / (r)) with / (t) a holo- 
morphic map H —^C. What does it mean for F to commute with the action of F? We must 
have 

F{yx) = yF{t) , i.e. , (yT,/(yr)) = (yr, jy(T)f(r)). 

Hence 

f(Yr) = jy{T)-f(T). 

Thus, if L]^ is the line bundle on F\M corresponding to jy{x)~^, where jy{r) is the 
canonical automorphy factor (4.24), then the condition becomes 

f{yr) = {cz + df^-f{T), 

i.e., condition (4.5a). Therefore the sections of Ly^ are in natural one-to-one correspondence 
with the functions on IHI satisfying (4.5a,b). The line bundle L]^ extends to a line bundle on 
the compactification /^\IHI*, and the sections of are in natural one-to-one correspondence 
with the modular forms of weight 2k. 



Poincare series 

We want to construct modular forms for subgroups F of finite index in r'(l). Throughout, 
we write F' for the image of F in F(l)/{ib/}. 

Recall the standard way of constructing invariant functions: if /? is a function on H, then 

/(z)'=' ^/j(yz) 
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is invariant under F, provided the series converges absolutely (which it rarely will). Poincare 
found a similar argument for constructing modular forms. 
Let 

be an automorphy factor for F; thus 

jyy'{z) = jy(Y'z)-jy>{z). 

Of course, we shall be particularly interested in the case 

7•,(z) = (cz + Jf^y = («^). 

We wish to construct a function / such that f(yz) = jy{z)- f(z). 
Try 

If this series converges absolutely uniformly on compact sets, then 
as wished. 

Unfortunately, there is little hope of convergence, for the following (main) reason: there 
may be infinitely many y 's for which jy (z) = 1 identically, and so the sum contains infinitely 
many redundant terms. Let 

Fo = {y e F' \ jy(z) = 1 identically}. 

For example, if jy(z) = (cz + d)~^^, then 

where h is the smallest positive integer such that ( J ^) e F (thus h is the width of the cusp 
ioo for F). In particular, Fq is an infinite cyclic group. 
If y, y' e ^Q, then 

ivY' (2) = jy (y'^) • Jy' (2) = 1 (all z) > 

and so Fq is closed under multiplication — ^in fact, it is a subgroup of F'. 

Let hhea holomorphic function on H invariant under Fq, i.e., such that h(Yoz) = h(z) 
for all Yo € Fq. Let y e F' and yo € /q; then 

h(YoYz) h(Yz) h(Yz) 

jyoyi^) jyoiY^)-jyiz) jyi^)' 
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i.e., h(yz)/jy{z) is constant on the coset Toy. Thus we can consider the series 

h{yz) 



yero\r'jy^'^ 

If the series converges absolutely uniformly on compact sets, then the previous argument 
shows that we obtain a holomorphic function / such that / (yz) = jy (z) ■ f(z). 

Apply this with jy{z) — (cz + d)'^^, y = and F a subgroup of finite index in 

As we noted above, Fq is generated by z z + /j for some h, and a typical function 
invariant under z\-^ z + his exTp(2ninz/ h), n = 0,1,2,... 

Definition 4.27 The Poincare series of weight 2k and character n for F is the series 

where F' is the image of F in F(l)/{±I}. 

We need a set of representatives for Fo\F'. Note that 



/I m\ fa b\_fa + mc b + md\ 
\Q \)'\c d)~\ c d )' 



Using this, it is easily checked that ^) and are in the same coset of Fq if and only 

if {c,d) = ±(c\d') and {a,b) = ±{a',b') mod /j. Thus a set of representatives for Fo\F' 
can be obtained by taking one element of F' for each pair {c,d), c > 0, which is the second 
row of a matrix in F'. 

Theorem 4.28 The Poincare series (Pniz) for 2k > 2, n > 0, converges absolutely uni- 
formly on compact subsets of H; it converges absolutely uniformly on every fundamental 
domain D for F, and hence is a modular form of weight 2k for F . Moreover, 

(a) (Pq{z) is zero at all finite cusps, and <po(ioo) = 1; 

(b) for alln > I, (pn (z) is a cusp form. 

Proof. To see convergence, compare the Poincare series with 

T ' 

' * \ffiz-\-n\^^ 
m,neZ,(w,n)^(0,0) ' ' 

which converges uniformly on compact subsets of H when 2k > 2. For the details of the 
proof, which is not difficult, see Gunning 1962, 111.9. □ 

Theorem 4.29 The Poincare series (pn(z), n>\, ofweight2k span Mk{F). 

Before we can prove this, we shall need some preliminaries. 
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The geometry of EI 

As Beltrami and Poincare pointed out, H can serve as a model for non-Euclidean hyperbolic 
plane geometry. 

Recall that the axioms for hyperbolic geometry are the same as for Euclidean geometry, 
except that the axiom of parallels is replaced with the following axiom: suppose we are 
given a straight line and a point in the plane; if the line does not contain the point, then there 
exist at least two lines passing through the point and not intersecting the line. 

The points of our non-Euclidean plane are the points of H. A non-Euchdean "line" is 
a half-circle in H orthogonal to the real axis, or a vertical half-line. The angle between 
two lines is the usual angle. To obtain the distance 8(zi , zj) between two points, draw the 
non-Euclidean line through zi and Z2, let ooi and 002 be the points on the real axis (or ioo) 
on the "line" labeled in such a way that ooi , zi , Z2, 002 follow one another cyclically around 
the circle, and define 

5(zi,Z2) = log Z)(zi,Z2, 001,002) 
where D(z] ,Z2,z-?,Z4) is the cross-ratio (^i~-^3)(^2~-^4) 

V i' ^5 f7 (Z2 — Z3)(Z1— Z4) 

def 

The group PSL2(M) = SL2(M)/{±/} plays the same role as the group of orientation 
preserving affine transformations in the Euclidean plane, namely, it is the group of transfor- 
mations preserving distance and orientation. 

The measure /x(J7) = ffjj plays the same role as the usual measure dxdy on — 
it is invariant under translation by elements of PSL2(M). This follows from the invariance of 
the differential y^-^dxdy. (We prove something more general below.) 

Thus we can consider ffj^ ^ for any fundamental domain D of F — the invariance 
of the differential shows that this doesn't depend on the choice of D. One shows that the 
integral does converge, and in fact that 



dx-dy/y^ = ln{2g-2 + Voc + Y^{l-l/ep)). 



See Shimura, 2.5. (There is a detailed discussion of the geometry of H — equivalently, the 
open unit disk — in C. Siegel, Topics in Complex Functions II, Wiley, 1971, Chapter 3.) 

Petersson inner product 

Let / and g be two modular forms of weight 2/c > for a subgroup F of finite index in 
F{\). 



Lemma 4.30 The differential f(z) ■ g{z) ■ y^^ ^dxdy is invariant under the action of 
SL2(M). (Here z = x + iy, so the notation is mixed.) 

Proof. Let y = ^). Then 

/ (yz) = (cz + d)^^ ■ f (z) (definition of a modular form) 

-2k 



giyz) = (cz + d) ■ g(z) (the conjugate of the definition) 

z + d\ 
dxdy 



-s(y^) = I "^^^l|2 (see p6) 
cz -I- o r 



y (dx ■ dy) 



\cz 
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The last equation follows from the next lemma and the fact (4.8) that dy/dz = 1/ (cz + d)'^. 
On raising the third equation to the {2k — 2) th power, and multiplying, we obtain the result. □ 

Lemma 4.31 For every holomorphic function w(z), the map z w{z) multiplies areas by 

, 2 
aw 



dz 

Proof. Write w{z) = u(x,y) + iv(x,y), z = x + iy. Thus, z \-> vl){z) is the map 

{x,y) ^ {u{x,y),v{x,y)), 

and the Jacobian is 

= UxVy - VxUy. 

Uy Vy 

According to the Cauchy-Riemann equations, w'{z) = Ux + ivx, Ux = Vy, Uy = —Vx, and 
so 

\W'(Z)\^ —ul + vl—UxVy-VxUy. □ 

Lemma 4.32 Let D be a fundamental domain for F . If f or g is a cusp form, then the 
integral 

f(z)-W)-y^''-^dxdy 



' D 

converges. 

Proof. Clearly the integral converges if we exclude a neighbourhood of each of the cusps. 
Near the cusp ioo, f (z) • g(z) = 0{e~'^^) for some c > 0, and so the integral is dominated 
by e^'^y y^^^dy < oo. The other cusps can be handled similarly. □ 

Let / and g be modular forms of weight 2k for some group F C r(l), and assume that 
one at least is a cusp form. The Petersson inner product of / and g is detined to be 



Lemma 4.30 shows that it is independent of the choice of Z). It has the following properties: 
o it is linear in the first variable, and semi-hnear in the second; 



o {f,g) = {gJ); 

o {/,/>>Oforall/^0. 

It is therefore a positive-definite Hermitian form on Sit^iP), and so SkiF) together with 
( , ) is a finite-dimensional Hilbert space. 

Completeness of the Poincare series 

Again let F be a subgroup of finite index in F(l). 

Theorem 4.33 Let f be a cusp form of weight 2k > 2 for F , and let (p„ be the Poincare 
series of weight 2k and character n > 1 for F . Then 
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where h is the width of ioo as a cusp for F and a„ is the nth coefficient in the Fourier 
expansion of f : 

Eoo litiiiz 

Proof. Write (p„ as a sum, and interchange the order of the integral and the sum. Look at a 
typical term. Write it as an integral over a fundamental domain for /q in H, 

{f^(p„)= / f(z)-exp(-27zinz/h)-y^^^''-^'^ -dxdy. 

Jx=0 Jy=0 

Now write / (z) as a sum, and interchange the order of integration and summation. Evaluate. 
See Gunning 1962, III 11, Theorem 5, for the details. □ 

Corollary 4.34 Every cusp form is a linear combination of Poincare series (p„ (z), « > 1 . 

Proof. If / is orthogonal to the subspace generated by the Poincare series, then all the 
coefficients of its Fourier expansion are zero. □ 

Eisenstein series for r{N) 

The Poincare series of weight 2k > 2 and character for F(N) is 

<po{z) = y^ -r (sum over (c,(i) = (0, 1) mod N , gcd(c,d) = i). 

^ {cz + dY'^ 

Recall (4.28) that this is a modular form of weight 2k for F(N) which takes the value 1 at 
/ oo and vanishes at all the other cusps. 

For every complex- valued function v on the (finite) set of inequivalent cusps for F(N), 
we want to construct a modular function / of weight 2k such that /| {cusps} = v. Moreover, 
we would like to choose the /'s to be orthogonal (for the Petersson inner product) to the 
space of cusp forms. To do this, we shall construct a function (restricted Eisenstein series) 
which takes the value 1 at a particular cusp, takes the value at the remaining cusps, and is 
orthogonal to cusp forms. 

Write 7y (z) = l/(cz + d)-^,so that (z) is an automorphy factor: 

;yy'(z) = jy{y'z)-jy'(z). 
Let P be a cusp for F(N), P ^ ioo, and let a e F(l) be such that a(P) = ioo. Define 

<P(^) = ;a(z)^-«9o(o'z). 

Lemma 4.35 The function (p(z) is a modular form of weight 2k for F(N); moreover cp 
takes the value \ at P, and it is zero at every other cusp. 

Proof. Let y ^ ^(-^)- For the first statement, we have to show that (fiyz) = jy{z)~^(p{z). 
From the definition of ^, we find that 

«f(yz) = j^iyzf -(poioyz). 

As F(N) is normal, aya~^ e F(N), and so 

(poicryz) = (po(oya~'^ -az) = j^y^~i {oz)~^ -(poioz). 
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On comparing this formula for ^(yz) with 



jy(z) ''■(p(z) = jyiz) ''■ja(zf-(po(oz), 
we see that it suffices to prove that 

ya(yz)-yffya-i(o'z)~^ = jy(z)~^ -jaiz), 

or that 

Jaiyz) • 7y (z) = jaya-i (or^) • ja{z)- 
But, because of the defining property of automorphy factors, this is just the obvious equaUty 

yay(z) = 7ffya-iff(z)- 
The second statement is a consequence of the definition of (p and the properties of (po-n 

We now compute (p{z). Let T be a set of coset representatives for Pq in r(N). Then 

= joizf ■Y.r^T jrioz)'' 
= JlteT Jroriz) 

Let o = SO that a'^ = (4° "^o), and P = a"! (i) = -6?o/co. Note that 

(c t/) ^ ^^^-^ ^ i^c d) (co t/o) ~ Go do) ^' 

From this, we can deduce that Pa contains exactly one element of P(Ny for each pair 
(c,d) with gcd(c,(3?) = 1 and (c,d) = (co,do). 

Definition 4.36 (a) A restricted Eisenstein series of weight 2k > 2 for P(N) is a 
series 

G(z;co,do;N) = ^(cz + J)-^^ 

(sum over (c,d) = (co,do) mod A'^, gcd(c, J) = 1). Here (co,do) is a pair such that 
gcd(co,io,A^) = 1. 
(b) A general Eisenstein series of weight 2k > 2 for P(N) is a series 

G(z; co,do;N) = ^(cz + d)'^'' 

(sum over (c,d) = (co,t/o) mod A'^, (c,d) ^ (0,0)). Here it is not required that 
gcd(co,i/o,A'^) = 1- 

Consider the restricted Eisenstein series. Clearly, 

G(z;co,do;N) = G(z;ci,di;N) 

if (co,do) = ± (c 1 , Ji ) mod N . On the other hand, we get a restricted Eisenstein series for 
each cusp, and these Eisenstein series are linearly independent. On counting, we see that 
there is exactly one restricted Eisenstein series for each cusp, and so the distinct restricted 
Eisenstein series are Unearly independent. 
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Proposition 4.37 The general Eisenstein series are the linear combinations of the re- 
stricted Eisenstein series. 

Proof. Omitted. □ 

Remark 4.38 (a) Sometimes Eisenstein series are defined to be the linear combinations 
of restricted Eisenstein series. 

(b) The Petersson inner product {/, g) is defined whenever at least one of / or g is a cusp 
form. One finds that {f,g) = (e.g., (po gives the zeroth coefficient) for the restricted 
Eisenstein series, and hence {f.g) = for all cusp forms / and all Eisenstein series 
g: the space of Eisenstein series is the orthogonal complement of Sk(r) in A4i^(r). 

For more details on Eisenstein series for r{N), see Gunning 1962, IV 13. 

Aside 4.39 In the one-dimensional case, compactifying r\M. presents no problem, and the Riemann- 
Roch theorem tells us there are many modular forms. The Poincare series allow us to write down a 
set of modular forms that spans Skir). In the higher dimensional case (see 2.27), it is much more 
difficult to embed the quotient r\D of a bounded symmetric domain in a compact analytic space. 
Here the Poincare series play a much more crucial role. In their famous 1964 paper, Baily and Borel 
showed that the Poincare series can be used to give an embedding of the complex manifold r\D 
into projective space, and that the closure of the image is a projective algebraic variety containing the 
image as a Zariski-open subset. It follows that r\D has a canonical structure of an algebraic variety. 

In the higher-dimensional case, the boundary of r\D, i.e., the complement of r\D in its 
compactification, is more complicated than in the one-dimensional case. It is a union of varieties 
of the form r'\D' with D' a bounded symmetric domain of lower dimension than that of D. The 
Eisenstein series then attaches to a cusp form on D' a modular form on D. (In our case, a cusp form 
on the zero-dimensional boundary is just a complex number.) 
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5 Hecke Operators 

Hecke operators play a fundamental role in the theory of modular forms. After describing 
the problem they were first introduced to solve, we develop the theory of Hecke operators 
for the full modular group, and then for a congruence subgroup of the modular group. 

Introduction 

Recall that the cusp forms of weight 12 for r(l) form a one-dimensional vector space over 
C, generated by Z\ = g| — 27g|, where g2 — 6OG2 and ^3 = I4OG3. In more geometric 
terms, A{z) is the discriminant of the elliptic curve C/Zz + Z. Jacobi showed that 

00 

A{z) = (Iny^-q- n (1 - ^")^'^' 1 = 
n=l 

Write /(z) = q ■ YlT=i(^ — q")'^^ — 'Y^Tin) -q". Then n 1-^ t(«) is the Ramanujan r- 
function. Ramanujan conjectured that it had the following properties: 

(a) \xip)\<2-p^^l^, 

( x{mn) — T(m)r(«) if gcd(m,«) = 1 

I r(p)z{p") — t(/>"+^) + /)^^(/>"~^) if is prime and n > 1. 

Property (b) was proved by Mordell in 1917 in a paper in which he introduced the first 
examples of Hecke operators. To A we can attach a Dirichlet series 

L(A,s) = y^r {n)n~\ 

Proposition 5.1 The Dirichlet series L(A,s) has an Euler product expansion of the form 

L(A,s)= n ) 11-2.) 

if and only if (b) holds. 
Proof. For a prime p, define 

7n>0 

If « e N has the factorization « = H P? ' then the coefficient of n~'' in fl Lp(s) is Y[ ''^{pl' )» 
which the first equation in (b) implies is equal to r(«). Thus 

L{A,s)^WLp{s). 

Now consider 

{\-x{p)p-' + p'^-^')-Lp. 

By inspection, we find that the coefficient of in this product is 

1 for n = 0; 
Ofor« = I; 



x(^pn+^) - ti.p)t{p'') + ;j1^t(;?"-1) for n + 1. 
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Thus the second equation in (b) impUes that (1 — t(p)p ^ + p^^ ^^)- Lp — 1, and 
hence that 

LiA,s) = Y\(l-r(p)p-' + p''-^T'- 

p 

The argument can be run in reverse. □ 
Proposition 5.2 Write 

\-r{p)X + p^^X^ = (l-aX)(l-a'xy, 

Then the following conditions are equivalent: 

(a) \t(p)\<2-p^'/^; 

(b) \a\ = p^^l^ = \a'\; 

(c) a and a' are conjugate complex numbers, i.e., a' = a. 

Proof. First note that x{p) is real (in fact, it is an integer). 

(b) =^ (a): We have x{p) = a + a' , and so (a) follows from the triangle inequality. 

(c) ^ (b): We have that = aa = aa' = p^^ . 

(a) ^ (c): The discriminant of \ — x{p)X + p^^X-^ r(/))^ — 4/7^\ which (a) implies 
is < 0. □ 

For each « > 1, we shall define an operator: 

T{n):Mk{r{\))^Mk{r{\)). 

These operators will have the following properties: 
T{m) o T{n) — T(mn) if gcd(m,rt) = 1; 
T(p)oT(p») = T(p"+^) + p^''-'^T(p»-^), p prime; 

T(n) preserves the space of cusp forms, and is a Hermitian (self-adjoint) operator on 

Skin- 

{T(n)f,g) = {f,T{n)g), f,g cusp forms. 

Lemma 5 . 3 Let V be a finite-dimensional vector space over C with a positive definite 
Hermitian form {,). 

(a) Let a: V bea linear map which is Hermitian (i.e., such that {av, v') = {v,av')); 
then V has a basis consisting of eigenvectors for a (thus a is diagonalizable). 

(b) Let ai,Q!2,... be a sequence of commuting Hermitian operators; then V has a 
basis consisting of vectors that are eigenvectors for all a, (thus the a, are simultaneously 
diagonalizable ). 

Proof, (a) Because C is algebraically closed, a has an eigenvector ei. Let Vi = (C-ei)-^. 
Because a is Hermitian, Vi is stable under a, and so it has an eigenvector 62. Let V2 = 
(Cei + Cei)"*". and continue in this manner. 

(b) From (a) we know that F = V(Xi) where the A, are the distinct eigenvalues for ai 
and V(Xi) is the eigenspace for A;-; thus ai acts as multiplication by A/ on V(Xi). Because 
a2 commutes with ai , it preserves each F(A, ), and we can decompose each F(A, ) further 
into a sum of eigenspaces for a2- Continuing in this fashion, we arrive at a decomposition 
y — such that each a, acts as a scalar on each Vj . Now choose a basis for each Vj 
and take the union. □ 
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Remark 5.4 The pair (F, { , )) is a finite-dimensional Hilbert space. There is an analogous 
statement to the lemma for infinite-dimensional Hilbert spaces (it's called the spectral 
theorem). 

Proposition 5.5 Let f(z) = ^c{n)q" he a modular form of weightlk, k > 0, f ^0. 
If f is an eigenfunction for all T(n), then c(l) 0, and when we normalize f to have 
c(l) = 1, then 

T{n)f = c(n)-f. 

Proof. See later (5.18). □ 

We call a nonzero modular function a normalized eigenform for r(«) if it is an eigen- 
function for T{n) and c(l) = 1. 

Corollary 5.6 If f{z) is a normalized eigenform for all T(n), then c(n) is real for all n. 
Proof. The eigenvalues of a Hermitian operator are real, because 

{av,v) = {Xv,v) =X{v,v), 

= {v,av) = {v,Xv) =X{v,v) 

for any eigenvector I). □ 

We deduce from these statements that if / is a normalized eigenform for all the T(n), 
then 

c(m)c(n) = c(mn) if gcd(m,n) = 1; 
c{p)c{p") = cip"^^) + />^^"^c(/?""^) if p is prime and « > 1. 

Just as in the case of A, this implies 
L{f,s)'^J2'in)-n-' = 

Write\-c{p)X + p^''-^-^'X^ = (l-aX){l -a' X). As in (5.2), the following statements 
are equivalent: 

] 

o \c(p)\ < 2- p^; 

II ^ I /I 
o \a\ = p 2 = \a \; 

o a and a' are complex conjugates. 

These statements are also referred to as the Ramanujan conjecture. As we mentioned earlier 
(p58), they have been proved by Deligne. 

Example 5.7 Because the space of cusp forms of weight 12 is one-dimensional, Z\ is a 
simultaneous eigenform for the Hecke operators, and so Ramanujan's Conjecture (b) for 
r(n) does follow from the existence of Hecke operators with the above properties. 



that 



n 



. (l-c(p)p-' + p^!"-^-^') 

p prime x-r ^ r 
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Note the similarity of L(f,s) to the L-function of an elliptic curve ^/Q, which is 
defined to be ^ 

L(E'')= n ) 1-2.- ri---- 

p good yr r ^ j^^j 

Here 1 — a(p) + p = #E(¥p) for good p. The Riemann hypothesis for E/¥p is that 
IflCi')! < '^■s/P- The number a (/>) can also be realized as the trace of the Frobenius map on a 
certain Q^-vector space ViE. Since rip) is the trace of T(p) acting on an eigenspace, this 
suggests that there should be a relation of the form 

-T(p) = np + np" 

where Up is the Frobenius operator at p. We shall see that there do exist relations of 
this form, and that this is the key to Deligne's proof that the Weil conjectures imply the 
(generalized) Ramanujan conjecture. 

Conjecture 5.8 (Taniyama-Weil) Let E be an elliptic curve over Q. Then L(E,s) = 
L{f,s) for some normalized eigenform of weight 2 for ro{N), where is the conductor of 
E. 

This conjecture is very important. A vague statement of this form was suggested by 
Taniyama in the 50's, was promoted by Shimura in the 60's, and then in 1967 Weil provided 
some rather compelling evidence for it. We shall discuss Weil's work in Section 6. Since 
it is possible to list the normalized eigenforms of weight 2 for PoiN) for a fixed A'^, the 
conjecture predicts how many elliptic curves with conductor there are over Q. Computer 
searches confirmed the number for small A'^, and, as noted in 2.26, the conjecture has been 
proved. 

The conjecture is now subsumed by the Langlands program which (roughly speaking) 
predicts that all Dirichlet series arising from algebraic varieties (more generally, motives) 
occur among those arising from automorphic forms (better, automorphic representations) for 
reductive algebraic groups. 



Abstract Hecke operators 

Let C be the set of full lattices in C. Recall (4.6) that modular forms are related to functions 
on C. We first define operators on C, which define operators on functions on C, and then 
operators on modular forms. 

Let V be the free abelian group generated by the elements of C; thus an element of V is 
a finite sum 

y^«; [Ai] , Hi eZ , Aj e C. 

For n = 1,2, ... we define a Z-linear operator T{n):V -^Vhy setting r(«)[7l] equal to the 
sum of all sublattices of A of index n : 

T{n)[A] = J2 [^']- 

A such that (A:A')=n 

The sum is obviously finite because any such sublattice A' contains nA, and A/nAis finite. 
Write R(n) for the operator 

R(n)[A] = [nA]. 
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Proposition 5.9 (a) If m and n are relatively prime, then 

T(m) o T(n) = T{mn). 

(b) If p is prime and n>\, then 

T{p") o Tip) = Tip"+') + pRip) o Tip"-'). 

Proof, (a) Note that 

Timn)[A] = J^IA"] (sum over A" with iA : A") = mn); 
Tim) o Tin)[A] = J^^A"] (sum over pairs (A', A") with iA:A') = n, iA' : A") = m). 

But, if A" is a sublattice of A of index mn, then there is a unique chain 

A^A'^ A" 

with A' of index n in A, because A/mnA is the direct sum of a group of order m and a 
group of order n . 

(b) Let y\ be a lattice. Note that 

Tip") o Tip)[A] = J2[A"] (sum over iA', A") with iA : A') = p, iA' : A") = p") 
Tip"+'')[A] = J^^A"] (sum over A with iA : A") = p"+''); 
pRip) o Tip"-'')[A] = P-J2 ^(P)i^'] (sum over A' c A with iA : A') = p"-'). 
Hence 

pRip) o Tip"-')[A] = • J2^A"] (sum over yl" c pA with (/jA : yl") = p"-'). 

Each of these is a sum of sublattices A" of index p"'^' in A. Fix such a lattice, and let 
a be the number of times it occurs in the first sum, and b the number of times it occurs in the 
last sum. It occurs exactly once in the second sum, and so we have to prove: 

(2 = 1+ pb. 

There are two cases to consider. 

The lattice A" is not contained in pA. Then b = 0, and a is the number of 
lattices A' containing A" and of index p in A. Such a lattice contains pA, and its image 
in A/ pA is of order p and contains the image of A", which is also of order p. Since the 
subgroups of A of index /? are in one-to-one correspondence with the subgroups of A/ pA 
of index p, this shows that there is exactly one lattice A', namely A + pA", and so a = 1. 

The lattice A" c pA. Here b = I. Every lattice A' of index p contains pA, and a 
fortiori A. We have to count the number of subgroups of A/ pA of index p, and this is the 
number of lines through the origin in the F^-plane, which is (p^ — !)/(/» — l) = /7+l. □ 

Corollary 5.10 For any m and n , 

Tim) o Tin) = ^ d ■ Rid)oTimn/d^) 

d\gcd(m,n),d>0 
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Proof. Prove by induction on s that 

T{p'-)oT(p')= J2 P'-Rip')°T{p'-+'-^'), 

i <min(r, j) 

and then apply (a) of the proposition. □ 

Corollary 5.11 LetV.be the Z-subalgebra of End(P) generated by the T{p) and R{p) 
for p prime; then T-L is commutative, and it contains T(n) for all n . 

Proof. Obvious from 5.10. □ 
Let F be a function C ^ C We can extend F by linearity to a function F:V ^C, 

F(E«,[A]) = E«/^(A). 
For any operator T on V, we define T ■ F tobe the function £ ^ C such that 

(T-F)([A]) = F(T[A]). 

For example, 

(T(n) ■ F)([A]) = ^ F([A']) (sum over sublattices A' of A of index n) 

and if F is of weig ht 2k, i.e., F(XA) = X-^^F(A) for all lattices A, then 

R{n)-F = n-^^'-F. 

Proposition 5.12 Let F:C^C be a function of weight 2k. Then T{n) ■ F is again of 
weight 2k, and for any m and n , 

T(m)-T(n)-F = d^~^'' ■ T {mn / d^) ■ F . 

d\gcd{m,n) , d>0 

In particular, if m and n are relatively prime, then 

T(m) ■ T(n) ■ F = T(mn) ■ F, 

and if p is prime and n>\, then 

Tip) ■ T{p") ■ F = T(p"+^) ■ F + p^-^'' ■ T(p"-^) ■ F. 
Proof. Immediate from Corollary 5.10 and the definitions. □ 

Lemmas on 2 x 2 matrices 

Before defining the action of Hecke operators on modular forms, we review some elementary 
results concerning 2x2 matrices with integer coefficients. 

Lemma 5.13 Let A be a 2x2 matrix with coefficients in Z and determinant n . Then there 
is an invertible matrix U in M2(Z) such that ■ ^ = (o rf) ^^^^ 

ad = n, a > 1,0 < b < d. (6) 

Moreover, the integers a, b, d are uniquely determined. 
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Proof. It is possible to put A into upper triangular form by using elementary operations of 
the following types: add a multiple of one row to a second; swap two rows (see ANT 2.44). 
Since these operations are invertible over Z, they amount to multiplying A on the left by an 
invertible matrix in M2(Z). For the uniqueness, note that a is the gcd of the elements in 
the first column of A, d is the unique positive integer such that ad — n, and b is obviously 
uniquely determined modulo J. □ 

Remark 5.14 Let M(n) be the set of 2 x 2 matrices with coefficients in Z and determinant 
n. The group SL2(Z) acts on M(n) by left multiplication, and the lemma provides us with a 
canonical set of representatives for the orbits: 

M{n) = SL2(Z) • (q ^ ) (disjoint union over a, ^, d as in the lemma). 

Now let yl be a lattice in C. Choose a basis coi , 0)2 for A, so that A = A(a)i , 0)2)- For any 
a = ^) e M{n), define aA = A{ao)\ + ha)2.C(x)i + da)2)- Then aA is a sublattice of A 
of index n, and every such lattice is of this form for some a e M(n). Clearly a A = fiA if 
and only if fi = ua for u e SL2(Z). Thus we see that the sublattices of A of index n are 
precisely the lattices 

A{aa)i+hu)2.d(i)2), a,b,d eZ, ad = n, a > 1,0 < b < d.. 

For example, consider the case n = p. Then the sublattices of A are in one-to-one correspon- 
dence with the lines through the origin in the 2-dimensional -vector space A/ pA. Write 
A/ pA = ¥pei®¥pe2 with = coj (mod p) . The lines through the origin are determined 
by their intersections (if any) with the vertical line through (1,0). Therefore there are p + 1 
lines through the origin, namely, 

¥p-ei, ¥p- (61+62), ¥p-(6i + (p-\)e2), ¥p(62). 

Hence there are exactly p + I sublattices of yl = Zwi -I- IjC02 of index p, namely, 

A(cOi,p(02), A(0)i + 0)2, pa)2), A(pO)i,0)2), 

in agreement with the general result. 

Remark 5.15 Let a e M(n), and let A' = a A. According to a standard theorem, we can 
choose bases coi, CO2 for A and a)[, for A' such that 

cjo[ — aa)i, = dQ)2, a, d e Z, ad = n, a\d,a>\ 

and a, 6? are uniquely determined. In terms of matrices, this says that 

M(«) = ySL2(Z).(^JJ rf)-SL2(Z) 

— disjoint union over a , sZ^ad = n,a\d,a>\. This decomposition of M(n) into a union 
of double cosets can also be proved directly by applying both row and column operations of 
the type considered in the proof of (5.13) to put the matrix ^) in the form (g 
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Hecke operators for r(l) 

Recall 4.6 that we have a one-to-one correspondence between functions F on £ of weight 
Ik and functions / on IHI that are weakly modular of weight 2k, under which 

F{A{o)i,Q}2)) = a)2'^^ ■ f{oyi/o)2), 
f{z) = F(A(z, 1)). 

Let / (z) be a modular form of weight 2k, and let F be the associated function of weight 2k 
on C. We define T(n) ■ f (z) to be the function on IHI associated with n^^~^ ■ T{n) ■ F . The 
factor n^^~^ is inserted so that some formulas have integer coefficients rather than rational 
coefficients. Thus 

T{n)-f{z) = n^^-^-{T{n)-F){A{z,\)). 

More explicitly, 

r(n)./(z) = «2^-i.^^-2V(^) 
where the sum is over the triples a,b, d satisfying (5.13(6)). 

Proposition 5.16 (a)If f isa weakly modular form of weight 2k forF(l), then T(n)-f 
is also weakly modular of weight 2k, and 

i) T{m)-T{n)- f = T{mn)- f ifm andn are relatively prime; 

ii) T(p) ■ Tip'') ■ f = T(p''+^) ■ f + • Tip"-^) ■ f if p is prime andn > I. 

(b) Let f be a modular form of weight 2k for F(l), with the Fourier expansion f = 
^m>oC{m)q'" , q = e'^^'^ . ThenT(n)-f is also a modular form, and 

T{n)-f{z)=Y,Yijn)-q^ 

with 

Y(m)= J2 a^'-'-cC^)- 

a\gcd{m,n), a>l 

Proof, (a) We know that 

T(p) ■ Tip") ■ F(Aiz, 1)) = T{p"+') ■ F(A(z, 1)) + p''^'' ■ Tip"'') ■ F{A{z, 1)). 

On multiplying through by (^«+i)2fc-i obtain the second equation. The first is obvious, 
(b) We know that 

a,b,d 

where the sum over a,b,d satisfying (6), i.e., such that 

ad=n, a>\, 0<b<d. 
Therefore T(n) ■ f(z) is holomorphic on H because / is. Moreover 
T(n)-f(z) = n^^-' J2 

a,b,d m>0 
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But 

^27ti^ _ t d. if d\m 

0<b<d 

Set m/d = m'; then 



otherwise. 



T(n) ■ f{z) = n^^-i J2 d-^^^'^ c{m'd)q'"^' 

a,d,m' 

where the sum is over the integers a,d,m' such that ad =n and a > 1. The coefficient of 
in this is 

a|gcd(n,f),a>l 

When we substitute m for t in this formula, we obtain the required formula. Because 
y(m) = for m < 0, T{n) • f is holomorphic at z oo. □ 

Corollary 5.17 Retain the notations of the proposition. 

(a) The coefficients y(0) = cr2k-i(n)-c(0), y(l) = c(m). 

(b) Ifn = p is prime, then 

i) Y(m) = c(pm) if p does not divide m; 

ii) y(m) = c(pm) + p^^~^c{m/ p) if p\m. 

(c) If / is a cusp form, then so also is T(n) • f. 

Proof. These are all obvious consequences of the proposition. □ 

Thus the r(n)'s act on the vector spaces Mkif^i^)) and Sk(r{l)), and satisfy the 
identities 

T(m) o T(n) = T(mn) if m and n relatively prime; 
T(p) o T{p") = Tip"+^) + ^j^^-i ■ T{p"-^) if p is prime n>\. 

Proposition 5.18 Let f = J2c(n)q" be a nonzero modular form ofweightlk. Assume 
that f is a simultaneous eigenform for all the T(n), say, 

T{n)-f = X{n)-f, A(«)eC. 

Then c(l) / 0, and if f is normalized to make c(l) = 1, then 

c{n) = X{n) 

for alln> 1 . 

Proof. We have seen that the coefficient of q in T{n) ■ f is c{n). But, it is also X{n) • c(l), 
and so c{n) = X{n) 0(1). If c(l) were zero, then all c{n) would be zero, and / would be 
constant, which is impossible. □ 

Corollary 5.19 Two normaUzed eigenforms of the same weight with the same eigenval- 
ues are equal. 

Proof. The proposition implies that the coefficients of their Fourier expansions are equal. □ 
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Corollary 5.20 If f = '^c(n)q" is a normalized eigenform for the T{n), then 
c(m) -cin) = c(mn) ifm and n are relatively prime, 
c(p)-c{p") = c(p"~^^) + p^^~^c{p"^^) if p is prime andn > 1. 

Proof. We know that these relations hold for the eigenvalues. □ 
With a modular form /, we can associate a Dirichlet series 

L(/,5) = ^c(n)-n-^ 

The series '^n~^ converges for > 1. The bounds on the values \c(n) \ (see p58) show 
that L(f,s) converges to the right of some vertical line (if one accepts Deligne's theorem 
and / is a cusp form of weight 2k, it converges for — /c + |) > 1, i.e., for s > k + 

Proposition 5.21 For every normaUzed eigenform f, 

L{f,s) - n ^ _ ^(^^^^-5 + p2k-i-2s ■ 
Proof. This follows from 5.20, as in the proof of (5. 1). □ 

The Hecke operators for r(l) are Hermitian 

Before proving this, we make a small excursion. 

Write GL2(M)"'" for the group of real 2x2 matrices with positive determinant. Let 



6GL2(M)" 



and let / be a function on H; we define 

f\ka = (detaf ■ (cz + dy^^ ■ f{^). 

For example, if a = (q then f\kOi = a^^ ■ a~^'^ ■ f(z) = f{z); i.e., the centre of 
GL2(M)+ acts trivially. Note that / is weakly modular of weight 2k for F C F{1) if 
and only if f\kOi = f for all a e F. 
Recall that 



T(n).f(z) = n^'-'.J2d-''-n^-^) 



d 

— sum over a,b,d , ad = n, a > I, < b < d. We can restate this as 



T(n)-f = J2f^^~'-f\kO' 



where the of's run through a particular set of representatives for the orbits of F(l) acting on 
M(n), i.e., for F(\)\M(n). It is clear from the above remarks, that the right hand side is 
independent of the choice of the set of representatives. 

Recall, that the Petersson inner product of two cusp forms / and g for F(l) is 

{f,8) = jjj-g-y^^-^-dxdy 

where z = x + iy and D is any fundamental domain for 7^(1). 
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Lemma 5.22 For every a e GL2(M)+ , 

{f\ka,g\kOi) = {f,g). 
Proof. Write co{f,g) = f (z)g{z)y^^^dxdy , where z = x + iy. We first prove that 

(oif\kOi,g\k(x) = a*(oif,g). (7) 

Since multiplying o; by a scalar changes neither co(f\j^a,g\i^a) nor a*(ii(f,g), we can 
assume that deta = 1. Then 

f\ka = (cz + dr^''-f(az) 

^-2k 



g\ka = (cz + d) "-"-giaz) 
and so 



(^{f\k(^^g\k'^) = \cz + d\ ■ f(az)-g(az)-dx-dy. 
On the other hand (see the proof of 4.30) 

S(o;z) S(z)/|cz + 
a*(dx-dy) = dx-dy /\cz + d\^, 

and so 



a* g)) = f{az) ■ giaz) ■\cz + d\'^-'^^ ■ j^^"^ ■\cz + d\-^ ■ dx- dy 
^a){f\ka,g\ka). 

From (7), 

// O)if\k0i,g\k0i) = a*Q)(f,g)= o)(f,g), 
JJD JJD JjaD 

which equals (/, g) if aD is also a fundamental domain for F(l). Unfortunately, this is not 
always true. Instead one computes the Petersson scalar product with respect to a sufficiently 
small congruence subgroup F such that aFa^^ c F(l) (and normalizes by the quotient 
of the volumes of the fundamental domains to get the wanted scalar product with respect 
to F(l)). If D now denotes a fundamental domain for F, then aD is a fundamental domain 
for aFa~^, and obviously it has the same volume as D, and by the choice of F, both of / 
and g are still modular with respect to aFa~^. □ 

Note that the lemma implies that 

(/U«,^> = {f,g\kc^~'), alia eGL2(M)+. 

Theorem 5.23 For cusp forms f, g of weight 2k 

{T(n)f,g) = {f,T(n)g), alln. 

Because of (5.10), it suffices to prove the theorem for T(p), p prime. Recall that M(n) 
is the set of integer matrices with determinant n . 



Lemma 5.24 There exists a common set of representatives {a, } for the set of left orbits 
F(l)\M(p) and for the set of right orbits M(p)/F(l). 
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Proof. Let a, fi e M(p); then (see 5.15) 

r(i)-a-r(i) = r(i)-(i «)-r(i) = r(i)-;6-r(i). 

Hence there exist elements u,v,u' ,v' e ^(1) such that 

uav = u'fiv' 

and so u'-'^ua = Pv'v~\ = y say. Then r(l)-a = r(l) • y and ^ • r(l) = y -TCI). □ 

For o; = ^) 6 M(p), set a' = = p-a~^ e M(p). Let a,- be a set of common 

representatives for r(l)\M(p) and M(p)/ r(l), so that 

M(j9) = y r(l) - a, = y a, • r(l) (disjoint unions). 



Then 

Therefore, 



M{p) = p.M{pr' = [jp-r(i)-ar' = y r(l)-a;. 

{T(p)f,g)^p''-'Yl(f\k^i'S) 
i 

^{f,T(p)g). 

The Z-structure on the space of modular forms for r(l) 
Recall (4.20) that the Eisenstein series 



^ , (mz + 

(m,n)^(0,0) ^ ' 

= 2^(2^) + 2l^^^a2,-i(«)^", q = e^-'-^ 

n = l 

For k >l, define the normalized Eisenstein series 

Ek(z) = Gk(z)/2i;{2k). 



Then, using that l^(2k) = ^yj^Bj^n , one finds that 

oo 

Ek{z)=\ + YkY.''^k-i{n)q\ Yk = {-if ■ 



n = l 



€ 

Bk 
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For example, 



£2(2) = 1+240 

n = l 

00 

Es (z) = 1 - 504 J2 ^5 , 



n=l 



54600 °° 



^6(z) = l + -^l]aii(nV. 

n=l 

Note that £'2(2) and £'3(2) have integer coefficients. 

Lemma 5.25 The Eisenstein series G^, k >2, is an eigenform of the T(n), with eigenval- 
ues a2k-iin). The normalized eigenform is y^"^ • E^. The corresponding Dirichlet series 
is 

^(s).^(s-2k + l). 

Proof. The short proof that Gy^ is an eigenform, is to observe that Alyt = 5;^ ® (G^/t)' and 
that T(n) ■ is orthogonal to (because is, T{n) is Hermitian, and T(n) preserves 
Sk). Therefore T(n) ■ G;t is a multiple of G^. 

The computational proof starts from the definition 

Therefore 

Tip)-G,iA) = Y: E ^ 
A' XeA',XjtO 

where the outer sum is over the lattices A' of index p in A. If X € pA, it lies in all A', and 
so contributes (p + l)/X^^ to the sum. Otherwise, it lies in only one lattice A', namely 
pA + ZA, and so it contributes 1 /A^*^. Hence 

iTip)-GkKA) = Gk{A) + p J2 = GkiA) + p'-^''GkiA) = (1 + p^'^'^JGkiA). 

XepA,X^O 

Therefore Gk{A), as a function on C, is an eigenform of T(p), with eigenvalue 1 + p^~-^^ . 
As a function on H it is an eigenform with eigenvalue p^^~^{\ + p^~-^^) — p-^^~^ + 1 = 

The normalized eigenform is 

00 

n=l ^ 
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and the associated Dirichlet series is 

,2yt-l 



oo , . 


= E 


n = l 


a,d>l 




-(^ 




\d>l 



= ^{s)-^(s-2k + l). □ 

Let F be a vector space over C. By a Z-structure on V, I mean a Z-module Vq C V 
which is free of rank equal to the dimension of V. Equivalently, it is a Z-submodule 
that is freely generated by a C-basis for V, or a Z-submodule such that the natural map 
Vo <8>z C ^ K is an isomorphism. 

Let M-k{1j) be the Z-submodule of consisting of modular forms / — 

YlT=o'^nq'^ with the <3„ € Z. 

Proposition 5.26 The module Mk{Z) is aZ-structure on Mk{r{l)). 

Proof. Recall that ^j^MkiC) = C[G2, G3] = C[E2, E3]. It suffices to show that 

Note that £2(2), E^iz), and A' = q ]~[(1 — q")'^^ all have integer coefficients. We prove 
by induction on k that TWy^C^) is the 2/cth-graded piece of Z[£'2. ^3] (here E2 has degree 4 
and £3 has degree 6). Given / (z) = '^Onq", a„ e Z, write 

with 4a -|- 6^ = 2k, and g e 7Wyt-i2- Then ao e Z, and one checks by explicit calculation 
thatgeX^_i2(Z). □ 

Proposition 5 .27 The eigenvalues of the Hecke operators are algebraic integers. 

Proof. Let Aik(Z) be the Z-module of modular forms with integer Fourier coefficients. It 
is stabilized by T{n), because. 

r(«)./(z)= ^y(m)-^- 

with 



y(m) = y^a^^ ^ •c( — r-) (sum over a\m ,a > 1). 



The matrix of T(n) with respect to a basis for A^;t(Z) has integer coefficients, and this 
shows that the eigenvalues of are algebraic integers. □ 



Aside 5.28 The generalization of (5.27) to Siegel modular forms of all levels was only proved in 
the 1980s (by Chai and Fallings), using difficult algebraic geometry. See Section 7. 
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Geometric interpretation of Hecke operators 

Before discussing Hecke operators for a general group, we explain the geometric significance 
of Hecke operators. Fix a subgroup F of finite index in r{\). 

Let a e GL2(M)''^. Then a defines a map x ax:H — > H, and we would hke to define 
a map q;:F\]HI r\E., fz i-> "aPz". Unfortunately, F is far from being normal in 
GL2(M)"'". If we try defining ai^Fz) = Faz we run into the problem that the orbit Faz 
depends on the choice of z (because a~^Fa ^ F in general, even if a has integer coefficients 
and r = F(N)). 

In fact, aFz is not even a F-orbit. Instead, we need to consider the union of the orbits 
meeting aFz, i.e., we need to look at FaFz. Every coset (right or left) of F in GL2(M)"'" 
that meets FaF is contained in it, and so we can write 



and then FaFz = [jFatz (disjoint union). Thus a, or better, the double coset FaF, 
defines a "many-valued map" 



Since "many-valued maps" don't exist in my lexicon, we shall have to see how to write this 
in terms of honest maps. First we give a condition on a that ensures that the "map" is at least 
finitely-valued. 

Lemma 5.29 Leta e GL2(M)+. Then FaF isafinite union of right (and of left) cosetsif 
and only if a is a scalar multiple of a matrix with integer coefficients. 

Proof. Omit. [Note that the next lemma shows that this is equivalent to a~^Fa being 
commensurable with F.] □ 



with ai = a- fit . 

Proof. We are given that F = \J(F na~^ Fa)^i. Therefore 

a-^FaF = [Ja~^ Fa-iF na~^ Fa)- = [J(a~^ FaF na~^ Fa)- ^i. 



But a FaF a Fa, and so we can drop it from the right hand term. Therefore 




r\EI^r\EI, Fz^{Faiz}. 



Lemma 5.30 Leta e GL2(M)+. Write 



F = \J{F r\a ^Fa)Pi (disjoint union); 



then 




a-^FaF = [ja-^Fafii. 



On multiplying by a, we find that FaF = [J^ Faf^i, as claimed. 

If Fa^i — Fa^j, then e a~^Fa; since it also lies in F, this implies that 

i = j. □ 
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Now let To; = r n a~ 



^ Fa, and write = U • (disjoint union). Consider 



r«\H 





r\M. 



The map a sends an orbit Fa • x to F • ax — this is now well-defined — and the left hand arrow 
sends an orbit • x to F • x. 

Let / be a modular function, regarded as a function on r\M. Then / o a is a function 
on ro;\BI, and its "trace" J2f is invariant under F, and is therefore a function on 

F\M.. This function is ^/ oa, = T(p)- f . Similarly, a (meromorphic) modular form 
can be thought of as a k-i6\A differential form on F\M., and T{p) can be interpreted as the 
pull-back followed by the trace in the above diagram. 

Remark 5.31 In general a diagram of finite- to-one maps 



is called a correspondence on X x Z. The simplest example is obtained by taking Y to be 
the graph of a map (p:X ^ Z; then the projection F ^ X is a bijection. A correspondence 
is a "many-valued mapping", correctly interpreted: an element x e X is "mapped" to the 
images in Z of its inverse images in Y . The above observation shows the Hecke operator 
on modular functions and forms is defined by a correspondence, which we call the Hecke 
correspondence. 

The Hecke algebra 

The above discussion suggests that we should define an action of double cosets FaF on 
modular forms. It is convenient first to define an abstract algebra, T-L{F, A), called the Hecke 
algebra. 

Let r be a subgroup of F{\) of finite index, and let Ahea. set of real matrices with 
positive determinant, closed under multiplication, and such that for a e ^, the double coset 
FaF contains only finitely many left and right cosets for F . Define ^{{F, A) to be the free 
Z-module generated by the double cosets FaF, a ^ A. Thus an element of T-L{F, A) is a 
finite sum. 



Write [a] for FaF when it is regarded as an element of ^.{F, A). 

We define a multiplication on T-L^F, A) as follows. Note that if FaF meets a right coset 
Fa' , then it contains it. Therefore, we can write FaF = \jFai, F^F = [jF^j (finite 
disjoint unions). Then 



Y 




X 



Z. 




FaF-F^F = FaFpF 



= [jFaFPj 
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therefore FaF^r is a finite union of double cosets. Define 

where the union is over the y s A such that FyF C FaF^F , and ^ is the number of 
pairs with Fajfij = Fy- 

Example 5.32 Let F = F(l), and let A be the set of matrices with integer coefficients 
and positive determinant. Then HiF, A) is the free abelian group on the generators 

F(l)^ ^^r(l), a\d, ad>0, a > 1, a,d sZ. 

Write T{a,d) for the element F(l){'^ o{n(F,A). Thus n(F,A) has a quite 

explicit set of free generators, and it is possible to write down (complicated) formulas for the 
multiplication. 

For a prime p, we define T(p) to be the element T(l, p) of HiF, A). We would like to 
define 

def 

T{n) = M{n) = {matrices with integer coefficients and determinant n}. 

We can't do this because M{n) is not a double coset, but it is a finite union of double cosets 
(see 5.15), namely, 

M(n)^[jF{\)-{^^ ^^•r(l), a\d, ad^n, a>l, a,deZ. 

This suggests defining 

T{n) = ^^T{a,d), a\d, ad=n, a > 1, a,d eZ. 

As before, we let V be the free abelian group on the set of lattices £ in C. A double 
coset [a] acts on V according to the rule: 

[a]- A = aA. 

To compute a A, choose a basis (^2) for yl, and let a A be the lattice with basis a • 
this is independent of the choice of the basis, and of the choice of a representative for 
the double coset FaF.) We extend this by linearity to an action of TiiF, A) on V. It is 
immediate from the various definitions that T(n) (element of TiiF, A)) acts on V as the 
T(n) defined at the start of this section. The relation in (5.10) implies that the following 
relation holds in the ring HiF, A): 

T(n)T(m)= d ■T(d,d)-T(nm/d^) (8) 

d\gcd{m,n) 

In particular, for relatively prime integers m and n , 

T(n)T(m) = T(nm), 
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and for a prime p. 

Tip) ■ Tip'') = T{p"+') + p ■ Tip, p) ■ Tip"-'). 
The ring HiF, A) acts on the set of functions on £: 

[a]-F = Y^ Fiat A) if FaF = [J . Fai . 
The relation (8) implies that 

Tin)- Tim)- F= d'~^^ -Timn/d^)- F 

rf|gcd(m,«) 

for F a function on £ of weight 2k. 

Finally, we make HiF, A) act on MkiFil)) by 

[a]-/ = det(a)^-i.^/Ua,- (9) 

if FaF = \jFii)- ai . Recall that 

/Ua = (deta)^-(cz + ^/)2^./(^) 

if a = The element Tin) e HiF, A) acts on MkiFil)) as in the old definition, and 

(8) implies that 

Tin)-Tim)-f= d^^'^ -Timn/d^)- f. 

d\gcd{m,n) 

We now define a Hecke algebra for FiN). For this we take Z\(A'^) to be the set of integer 
matrices a such that n = det(a) is positive and prime to A^, and a = ( ^ mod A^. 

Lemma 5.33 Let A'iN) be the set of integer matrices with positive determinant prime to 
N . Then the map 

FiN) - a - FiN) ^ r(l) • a - Fiiy.niFiN), AiN)) UiFil), A'iN)) 
is an isomorphism. 

Proof. Elementary. (See Ogg 1969, pIV- 10.) □ 

Let T^ia,d) and T^in) be the elements of TiiFiN), AiN)) corresponding to Tia,d) 
and Tin) in TiiFil), A'iN)) under the isomorphism in the lemma. Note that TiiF il) , A' iN)) 
is a subring of TiiFil), A). From the identity (8), we obtain the identity 

T^in)T^im)= ^ d -T^ id,d)-T^ imn/d^) (10) 

d\gcd{n,m) 

for imn,N) = I. 

When we let niFiN),AiN)) act on MkiFiN)) by the rule (9), the identity (8) 
translates into a slightly different identity for operators on M.kiFiN)). (The key point 
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is that 0) e A'{N) if gcd(c?,A^) = 1 but not Z\(A^)— see Ogg 1969, pIV-12). For 
/ ^ Mk{r{N)),vjQ have the identity 

T^(n)-T^{m)-f = d^'"'^ ■ -T^ (mn/d^)- f 

d \ gcd(m,n) 

for (mn,N) = 1. Here is a matrix in r(l) such that R^ = mod A'^. 

The term R^ causes problems. Let V = Mk(r(N)). If <i = 1 mod A^, then R^ e r(N), 
and so it acts as the identity map on V. Therefore d R^ defines an action of (Z/A^Z)^ 
on V, and so we can decompose V into a direct sum, 

over the characters s of (X/ NZ)^ , where 

Vie) = {feV\f\kRci=s{d)-f}. 

Lemma 5.34 The operators R„ and (m) on V commute for (nm, N) = \ . Hence V(e) 
is invariant under (m). 

Proof. See Ogg 1969, pIV-13. □ 
Let Mk(r(N),6) = V(s). Then T^(n) acts on X^(r(A^),e) with the basic identity: 
T^{n)-T^{m)= ^ d^'"'^ -sid)-!^ {nm/d^), 

<i|gcd(n,/?3) 

for (nm, N) = 1. 

Proposition 5.35 Let f e Mk(r(N),e) tiave the Fourier expansion f = Y.anq" ■ As- 
sume that f is an eigenform for all («), and normalize it so that a\ = I. Then 

Proof. Essentially the same as the proof of Proposition 5.1. □ 

Let U = (o i) (it would be too confusing to continue denoting it as T). Then € 
r(N), and so 

f^f\kU"' = f{z + m) 

def 

defines an action of Z/NZ on F = M.ic(r(N)). We can decompose V into a direct sum 
over the characters of Ia/NIj. But the characters of Ij/NIj are parametrized by the A^th 
roots of one in C — the character corresponding to ^ is m mod N t,'^. Thus 

F = 0^F(O, ^anA^throotof 1, 

where K(0 = {f e V \ U"' ■ f = I;"" f}. Alas K(0 is not invariant under {n). To 
remedy this, we have to consider, for each t\N, 

7(0 = V{i;) , i; a primitive {N / t)th. root of 1 . 
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Let m be an integer divisible only by the primes dividing N ; we define 

Q<b<m 

For a general « > 1, we write n = mno with gcd(no> N) = 1, and set 

T\n) = T(no)-T'(m). 

We then have the relation: 

T\n)-T'{m)-f = <d)d^^-^T* {nm/d'^)- f 

<i|gcd(n,m) 

for f eV{e,t) = V{s)r\V{t). 

Theorem 5.36 Let / e V{t,e) have the Fourier expansion f{z) = J^^n^l"^- If ai — I 
and f is an eigenform for all the T*(n) with gcd(n,N) = 1, then the associated Dirichlet 
series has the Euler product expansion 

J^anU - n 1 _ + p2k-\-2s ■ 

Proof. See Ogg 1969, pIV-10. □ 

In the statement of the theorem, we have extended e from (Z/A^Z)^ to Z/A'^Z by setting 
s{p) — for p\N . Thus s{p) = if p\N , and a^, = if p\y- This should be compared 
with the L-series of an elMptic curve E with conductor N, where the /?-factor of the L-series 
is (1 lb p~^)~^ if p\N but p'^ does not divide N, and is 1 if p'^\N. 

Proposition 5.37 Let f and g be cusp forms for r(N) of weight 2k and character s. 
Then 

{T{n)-f,g)=e{n)U,T(n)-g) 
Proof. See Ogg 1969, pIV-24. □ 

Unlike the case of forms for F(l), this does not imply that the eigenvalues are real. It 
does imply that Mk{r{N),s,t) has a basis of eigenforms for the T{n) with gcd(n,N) = 1 
(but not for all r(n)'s). 

For a summary of the theory of Hecke operators for ro(N), see Milne 2006, V 4. 
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The Algebro-Geometric Theory 



In this part we apply the preceding theory, first to obtain elliptic modular curves defined over 
number fields, and then to study the zeta functions of modular curves and of elliptic curves. 
There is considerable overlap between this part and my book on elliptic curves. 

6 The Modular Equation for FoiN) 

For every congruence subgroup F of r(l), the algebraic curve /^\]HI* is defined over a 
specific number field. As a first step toward proving this general statement, we find in 
this section a canonical polynomial F(X, Y) with coefficients in Q such that the curve 
F(X, 7) = is birationally equivalent to Xo{N) = ro(A^)\H*. 
Recall that 

{V ")(: ^)(o - (: 



c = mod 



and so 



Fo(N) = F(l)nY-'F(l)Y. 
Note that -/ e ro(A^). In the map 

SL2(Z)^ SL2(Z/NZ) 

the image of Fq{N) is the group of all matrices of the form ^ in SL2(Z/A^Z). This 

group obviously has order A'^ ■ (p(N), and so (cf. 2.23), 

IX = (f(l) : Fo(N)) = (r(l) : FoiN)) = • [] /'l + 1 V 

p\N^ 

(Henceforth, F denotes the image of F in SL2(Z)/{ib/}.) Consider the set of pairs {c,d) 
of positive integers satisfying: 

gcd(c,J) = l, d\N, 0<c<N/d. (11) 
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For each such pah; we choose a pair {a,b) of integers such that ad — be = 1. Then the 
matrices (" ^) form a set of representatives for ro(N)\r(l). (Check that they are not 
equivalent under left multiplication by elements of ro(N), and that there is the correct 
number.) 

If 4| A'^, then /o(A^) contains no elliptic elements of order 2, and if 9\N then it contains 
no elliptic elements of order 3. The cusps for ro(N) are represented by the pairs (c,d) 
satisfying (1 1), modulo the equivalence relation: 

(c,d) ~ (c' , d') li d = d' and c' = c + m , some m e Z. 

For each d, there are exactly (p{gc&{d, N/d)) inequivalent pairs, and so the number of cusps 
is 

J2 <p(gcd(j,f)). 

d\N,d>0 

It is now possible to use Theorem 2.22 to compute the genus of Xo(N). (See Shimura 1971, 
p25, for more details on the above material.) 

Theorem 6. 1 The field C(Xo(N)) of modular functions for FoiN) is generated (over C) 
by j(z) and j(Nz). The minimum polynomial F{j,Y) e C(j)[F] of j(Nz) overC(j) 
has degree ji. Moreover, F(j,Y) is a polynomial in j and has coefficients in Z, i.e., 
F(X, Y) e Z[X, Y]. When N > I, F{X, Y) is symmetric in X and Y, and when N = p is 
prime, 

F(X,Y) = XP+'^ + YP+^ -X^YP -XY mod p. 
Proof. Let y = ^) be an element of Fq(N) with c = Nc', c' e Z. Then 

^ .(Naz + Nh\ .(Naz + Nb\ .fa{Nz) + Nb\ .^^^ ^ 
^^^^"^ = J i^TT^ j = ' [-N?^ )='[ c'(Nz) + d )= ^^^^^ 

because e ^(1)- Therefore C(j(z),j(Nz)) is contained in the field of modular 

functions for Fo(N). 

The curve Xo(N) is a covering of X(l) of degree /x = (F(l) : Fo{N)). From Proposition 
1.16 we know that the field of meromorphic functions C(Xo{N)) on Xo(N) has degree 
jx over C(X(l)) — C(j), but we shall prove this again. Let {yi — l,...,Yfi} be a set of 
representatives for the right cosets of /o(A^) in F(l), so that, 

F(\) = [jFo(N)Yi (disjoint union). 

For every y e F(l), {yiy, y/^y} is also a set of representatives for the right cosets of 
Fo(N) in r(l) — the set {Fo(N)YiY} is just a permutation of the set {Fo(N)Yi}- 

If / (z) is a modular function for Fo{N), then / (y,- z) depends only on the coset Fq {N)yi . 
Hence the functions {/(y/yz)} are a permutation of the functions {/(y,z)}, and every 
symmetric polynomial in the /(y/z) is invariant under F(l); since such a polynomial 
obviously satisfies the other conditions, it is a modular function for F(l), and hence a 
rational function of j . We have shown that /(z) satisfies a polynomial of degree fi with 
coefficients in C(7), namely, n(7 — /(y/z)). Since this holds for every / €C(Xo(N)), 
we see that C(Xo(N)) has degree at most /x over C(j ). 

Next I claim that all the fiYi^) are conjugate to /(z) overC(7): for let F(7, 7) be the 
minimum polynomial of /(z) over C (_/'); in particular, F(J,Y) is monic and irreducible 
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when regarded as a polynomial in Y with coefficients in C{j); on replacing z with y,z and 
remembering that j(Yiz) = j(z), we find that F(J(z), f (y/z)) = 0, which proves the claim. 

If we can show that the functions j(NYiz) are distinct, then it will follow that the 
minimum polynomial of j(Nz) over C(J) has degree /x; hence [C(Xo{N)) : C{j)] = /x and 
C(XoiN)) = C(j(z))[j(Nz)]. 

Suppose j{Nyiz) = jiNyi'z) for some / ^ i'. Recall that j defines an isomorphism 
F(1)\IH[* (Riemann sphere), and so, if j(NYiz) — jiNyj/z) for all z, then there exists 
aye F(l) such that Nyiz = yNyt'Z for all z, which implies that 

Hence y/y,7^ e ^(1) n r(l) = ro(A^), and this contradicts the fact 

that Yi and y,/ lie in different cosets. 

The minimum polynomial of j(Nz) over C^') is F{j,Y) = YiiY — j(Nyjz)). The 
symmetric polynomials in the j(Nyiz) are holomorphic on H. As they are rational functions 
of Jiz), they must in fact be polynomials in j(z), and so F(X, Y) e C[X, Y] (rather than 
C(X)[Y]). 

But we know (4.22) that 

00 

j{z)^q-' + J2^'rq" (12) 

with the c„ € Z. Consider j(Nyz) for some y = (^,' ^',) e F(l). Then A^yz = ^/)^' 
and j(Nyz) is unchanged when we act on the matrix on the left by an element of F(l). 
Therefore (see 5.15) 

az + b 
j(Nyz) = Ji^^) 
a 

for some integers a,b,d with ad = N . On substituting "^J^^ for z in (12) and noting that 
^2ni{az+b)ld _ ^inibld . ^iniazld ^ ^j^j ^j^^j- j(^]\fy2') has a Fourier expansion in powers 
of q^^^ whose coefficients are in and hence are algebraic integers. The same 

is then true of the symmetric polynomials in the j(Nyiz). We know that these symmetric 
polynomials lie in C[7 (z)], and I claim that in fact they are polynomials in j with coefficients 
that are algebraic integers. 

Consider a polynomial P = ^Cnj" e C[y] whose coefficients are not all algebraic 
integers. If Cm is the coefficient having the smallest subscript among those that are not 
algebraic integers, then the coefficient of q^"^ in the ^-expansion of P is not an algebraic 
integer, and so P can not be equal to a symmetric polynomial in the j(Nyiz). 

Thus F(X, Y) — ^Cm,nX"^Y'^ with the Cm,n algebraic integers (and co,/x = 1)- 

When we substitute (12) into the equation 

F(j(z)J(Nz)) = 0, 

and equate coefficients of powers of ^, we obtain a set of finear equations for the Ci„,n with 
rational coefficients. When we adjoin the equation 
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then the equations determine the Cm,n uniquely (because there is only one monic minimum 
equation for j(Nz) over C(J)). Because the system of linear equations has a solution in C, 
it also has a solution in Q; because the solution is unique, the solution in C must in fact lie 
in Q. Thus the Cm,n ^ Q> but we know that they are algebraic integers, and so they lie in Z. 

Now assume A'^ > 1. On replacing z with —l/Nz in the equation F(J(z),j(Nz)) — 0, 
we obtain 

F(j(-l/Nz)J(-l/z)) = 0, 
which, because of the invariance of j , is just the equation 

F(jiNz)Jiz)) = 0. 

This shows that F(Y,X) is a multiple of F(X,Y) (recall that F(X,Y} is irreducible in 
C(X)[Y], and hence in C[X, 7]), say, F(Y, X) = cF(X, Y). On equating coefficients, one 
sees that = \, and so c = ±1. But c = — 1 would imply that F{X,X) = 0, and so 
X — Y would be a factor of F{X, Y), which contradicts the irreducibility. Hence c = 1, and 
F{X, Y) is symmetric. 

Finally, suppose N = p,a prime. The argument following (12) shows in this case that 
the functions j{pYiz) for i ^ 1 are exactly the functions: 

i(^y m = 0,l,2,...,p-l. 

Let = e^^^'^P, and let p denote the prime ideal (1 - ^p) in Zgp]. Then p^"^ = (p). 
When we regard the functions ji^-y^) as power series in q, then we see that they are all 
congruent modulo p (meaning that their coefficients are congruent modulo p), and so 

P-i , 
F{j{z)J)'^{Y-j{pz))Y\{Y-j{^^)) 

m=0 ^ 

^(Y-j(pz))(Y-j(z/p)r (modp) 
^(Y-j(zr)(YP-j(z)) (modp). 

This implies the last equation in the theorem. □ 

Example 6.2 For N = 2, the equation is 

X^ + Y^- X^Y^ + mSXY(X + 7) - 162000(X^ + Y^) + 40773375X7 

+ 8748000000(X + 7) - 157464000000000 = 0. 

Rather a lot of effort (for over a century) has been put into computing F(X, 7) for small 
values of A'^. For a discussion of how to do it (complete with dirty tricks), see Birch's article 
in Modular Functions of One Variable, Vol I, SLN 320 (Ed. W. Kuyk). 

The modular equation F^iX, 7) = was introduced by Kronecker, and used by Kron- 
ecker and Weber in the theory of complex multiplication. For = 3, it was computed by 
Smith in 1878; for A' = 5 it was computed by Berwick in 1916; for A'^ = 7 it was computed 
by Herrmann in 1974; for A^ = 11 it was computed by MACSYMA in 1984. This last 
computation took 20 hours on a VAX-780; the result is a polynomial of degree 21 with 
coefficients up lO^'' which takes 5 pages to write out. See Kaltofen and Yui, On the modular 
equation of order 11, Proc. of the Third MACSYMA's user's Conference, 1984, pp472-485. 

Clearly one gets nowhere with brute force methods in this subject. 
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7 The Canonical Model of Xo{N) over Q 

After reviewing some algebraic geometry, we define the canonical model of Xo(N) over Q. 
Brief review of algebraic geometry 

Theorem 6.1 will allow us to define a model of Xq(N) over Q, but before explaining this 
I need to review some basic definitions from algebraic geometry. First we need a slightly 
more abstract notion of sheaf than that on pi 6. 

Definition 7.1 A presheaf F on a topological space is a map assigning to each open 
subset U of X n set F{U) and to each inclusion U D U' a. "restriction" map 

a^a\U' : F(U)^ F(U'). 

The restriction map corresponding to U D U is required to be the identity map, and if 
U D U' D U", then the restriction map F(U) F(U") is required to be the composite of 
the restriction maps F(U) F(U') and F{U') F{U"). 

If the sets F{U) are abelian groups and the restriction maps are homomorphisms, then 
F is called a presheaf of abelian groups (similarly for a sheaf of rings, modules, etc.). 

A presheaf F is & sheaf if for every open covering {Ui} ofU (ZX and family of elements 
Uj e F(Ui) agreeing on overlaps (that is , such that a ,■ | Ui riUj = aj\ Ui fl Uj for all 
there is a unique element a e F(U) such that a,- = a\Uj for all i. 

A ringed space is a pair (X, Ox) consisting of a topological space X and a sheaf of 
rings Ox on X. With the obvious notion of morphism, the ringed spaces form a category. 

Let ko be a field, and let k be an algebraic closure of ko. An affine k^-algebra ^4 is a 
finitely generated A:o-algebra A such that k (8>A:o A is reduced, i.e., has no nonzero nilpotent 
elements. This is stronger than saying that A itself is reduced: for example, 

A = ko[X,Y]/{XP + YP +a) 

is an integral domain when p = char(/co) and a ^ k^, because X^ + + a is then 
irreducible, but obviously 

k^koA = k[X, Y]/iXP + YP+a) 
^k[X.Y]/({X + Y +apy) 

is not reduced. This problem arises only because of inseparability — if ko is perfect, then 
every reduced finitely generated /:o-algebra is an affine /co-algebra. 
Let ^ be a finitely generated /co-algebra. We can write 

A = ko[xi,...,Xn] = ko[Xi,...,X„]/(fi,...,fm), 

and then 

k^koA = k[Xi,...,X„]/(f,,...,fm). 

Thus A is an affine algebra if and only if the elements /i , fm of ko[Xi , Xn] generate 
a radical ideal' in k[Xi, X„]. 

'An ideal o is radical if 

f" w > 1 / go. 
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Let A be an affine /co-algebra. Define specm(/l) to be the set of maximal ideals in A, 
and endow it with the topology for which the sets 

^(/) = {m|/^m}, feA, 

form a base for the open sets. There is a unique sheaf of /jQ-algebras O on specm(/l) such 
that OiD(f)) = Af = A[f-^] for all /. Here O is a sheaf in the abstract sense — the 
elements of 0(J7) are not functions on U with values in ko, although we may wish to think 
of them as if they were. For f ^ A and e specm^i, we define / (v) to be the image of 

def 

/ in the k(v) = A /my. Then v ^-^ f(v) is not a function on specm(^) in the conventional 
sense because, unless ko = k, the fields k{v) vary with v, but it does make sense to speak of 
the set V{f) of zeros of f in X, and this zero set is the complement of !)(/). 
The ringed space 

def 

Specm(^) = (specm(^),C'), 

as well as every ringed space isomorphic to such a space, is called an affine variety over kg. 
A ringed space (X, Ox) is a prevariety over ko if there exists a finite covering (Ui) of X by 
open subsets such that (Ui , Ox \ Uj) is an affine variety over ko for all / . 

A morphism, or regular map, of prevarieties over ko is a morphism of ringed spaces. 
For example, when ko — k, we can regard Ox as a sheaf of k-valued functions on X, and a 
regular map of prevarieties (X, Ox) (Y, Oy) is a continuous map (p:X such that 

f sOy{U),U op&nmY ^ f o(p s Ox{(p~^U). 

When ko ^ k, it is necessary to specify also the map on sheaves. A prevariety X over k is 
separated if for all pairs of regular maps Z — ^ X, the set where the maps agree is closed in 
Z. A variety is a separated prevariety. 

When V = Specm^ and W = SpecmS, there is a one-to-one correspondence between 
the regular maps W V and the homomorphisms of /co-algebras A ^ B. For example, if 

A = ko[Xi,...,Xm]/a = k[xi,...,Xm] 
B = ko[Y,,...,Y„]/b = k[y,,...,yn], 

then a homomorphism ^4 ^ S is determined by a family of polynomials 

(Pi)i = i,...,m, Pieko[Yi,...,Y„], 

(representatives for the images of xi , . . . , x^)- The corresponding regular map W —^V sends 
{b\,..., b„) e W(k) to (. . . , Pi (bi ,...,bn), . . .) e V(k). In order to define a homomorphism, 
the Pi must be such that 

F6o^F(Pi,...,P^)€b; (13) 

in particular, if (bi,..., b„) e W(k), then (13) implies that F(. . . , Pi (bi , ...,bn),- . .) = 0, 
and so W{k) does map into V(k). Two families Pi,..., Pm and Qi, Qm determine the 
same map if and only if P, = Qi mod b for all /. 

There is a canonical way of attaching a variety X over /: to a variety Xo over ko; for 
example, if Xo = Specm(^), then X = Specm(/c ^j^^A). We then call Xo a model for 
X over ko. When X C A", to give a model for X over ko is the same as giving an ideal 
ao C ko[Xi ,...,Xn\ such that Oq generates the ideal of X, 

I{X) = {f €k[Xi,...,Xn]\f = QonX}. 
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Of course, X need not have a model over ko — for example, an elhptic curve E over k will 
have a model over ko C k if and only if its 7 -invariant j(E) lies in ko. Moreover, when 

X has a model over ko, it will usually have a large number of them, no two of which are 
isomorphic over ko- For example, let X be a nondegenerate quadric surface in over k. 
Such a surface is isomorphic to the surface 

X^ + Y^ + Z^ + W^ = 0. 

The models of X over ko are defined by equations 

aX^ + bY^ + cZ^ + dW^ ^0, a,b,c,deko, abcd^O. 

Thus classifying the models of X over ko amounts to classifying the equivalence classes of 
quadratic forms over ko in 4 variables. If ko = Q, there are infinitely many. 

Let X be a variety over ko- A point of X with coordinates in ko, or a ko-point of X, is a 
morphism Specm^o ^ ^- For example, if Z is affine, say X = Specm^, then a point of X 
with coordinates in ko is a /co-homomorphism ^ ^ ko- If ^ = k[Xi, (fi, fm), 
then to give a ^o-homomorphism ^ ^ ^0 is the same as giving an «-tuple {ai,...,a„) such 
that 

fi(ai,...,an) = i = \,...,m. 

In other words, a point of X with coordinates in ko is exactly what you expect it to be. 
Similar remarks apply to projective varieties. We write X{ko) for the points of X with 

coordinates in ko- 

Similarly, it is possible to define the notion of a point of X with coordinates in any ko- 
algebra R, and we write X{K) for the set of such points. For example, when X = Specm^, 

X{R) = Homfc.aigebra(^, 

When k = ko, X(ko) ~ X- How is X(ko) related to X when k 7^ A:o? Let v e X- Then v 
corresponds to a maximal ideal mv (actually, it is a maximal ideal), and we write k(v) for 
the residue field Ov/mv- This is a finite extension of ko, and we call the degree of k(v) over 
ko the degree of v. The set X(ko) can be identified with the set of points t; of Z of degree 1. 
(Suppose for example that X is affine, say X — Specm^. Then a point d of Z is a maximal 
ideal in A- Obviously, rrii, is the kernel of a /ro-homomorphism A ^ koif and only if 

def 

ic{v) = A/m-i; = ko, in which case it is the kernel of exactly one such homomorphism.) 

The set X(k) can be identified with the set of points on X;^, where is the variety over 
k defined by X. When ko is perfect, there is an action of Gal(k/ko) on X(k), and one can 
show that there is a natural one-to-one correspondence between the orbits of the action and 
the points of X- (Again suppose X = Specm^, and letveX; then v corresponds to the 
set of /cQ-homomorphisms A^ k with kernel m^, which is a single orbit for the actiion of 
Galik/ko).) 

Assume ko is perfect. As we just noted, if Xo is a variety over ko, then there is 

def 

an action of Gal(k/ko) on Xo(k)- The variety X = (Xo)j^ together with the action of 
Gal{k/ko) on X(k) determine Xo- For example, if Xo — Specm^o, so Z = Specm^l with 
A = k (^ko then the action of Gal(A:/A:o) on X(k) determines an action of Gal(k/ko) 
on ^ and Ao = ^Gai(fc/A;o). 

For more details on this material, see my notes on Algebraic Geometry (AG), especially 
Chapters 2, 3, and 11. 
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Curves and Riemann surfaces 

Fix a field ko, and let X be a connected algebraic variety over ko. The function field ko{X) 
of X is the field of fractions of Ox{U) for any open affine subset U of X; for example, if 
X = Specm^, then koiX) is the field of fractions of A. The dimension of X is defined to 
be the transcendence degree of ko(X) over ko. An algebraic curve is an algebraic variety 
of dimension 1. 

To each point u of X there is attached a local ring Oy. For example, if X = Specm^, 
then a point v of X is a maximal ideal m in A, and the local ring attached to v is Am- An 
algebraic variety is said to be regular if all the local rings Am are regular ("regular" is a 
weaker condition than "nonsingular"; nonsingular implies regular, and the two are equivalent 
when the ground field ko is algebraically closed). 

Consider an algebraic curve X. Then X is regular if and only if the local rings attached 
to it are discrete valuation rings. For example, Specm^ is a regular curve if and only if A is 
a Dedekind domain. A regular curve X defines a set of discrete valuation rings in ko(X), 
each of which contains ko, and X is complete if and only if this set includes all the discrete 
valuation rings in ^o(^) having koiX) as field of fractions and containing ko- 

A field K containing ko is said to be a function field in n variables over ko if it is 
finitely generated and has transcendence degree n over ko- The field of constants of K is 
the algebraic closure of ko in K- Thus the function field of an algebraic variety over ko of 
dimension n is a function field in n variables over ko (whence the terminology). 

Theorem 7.2 The map X v> ko(X) defines an equivalence from the category of complete 
regular irreducible algebraic curves over ko to the category of function fields in one variable 
over ko; the curve X is geometrically irreducible^ if and only ifko is the field of constants 
ofko(X). 

Proof. The curve corresponding to the field K can be constructed as follows: take X to be 
the set of discrete valuation rings in K containing ko and having K as their field of fractions; 
define a subset J7 of X to be open if it omits only finitely many elements of X; for such a U, 
define Ox(U) to be the intersection of the discrete valuation rings in U- □ 

Corollary 7.3 A regular curve U can be embedded into a complete regular curve U ; the 
map U U is universal among maps from U into complete regular curves. 

Proof. Take U to be the complete regular algebraic curve attached to ko(U). There is an 
obvious identification of U with an open subset of U . □ 

Example 7.4 Let F(X, Y) be an absolutely irreducible polynomial in ko[X, Y], and let 

def 

A = ko[X, Y]/ {F(X, Y)). Thus A is an affine /:o-algebra, and C = Specm^ is the curve 
F(X, Y) = 0. Let C"^ be the complement in C of the set of maximal ideals of A containing 
the ideal (df/ dX, dF / dY) mod F{X, Y). Then C^^ is a nonsingular curve, and hence can 
be embedded into a complete regular curve C . 

There is a geometric way of constructing C , at least in the case that ko = k is alge- 
braically closed. First consider the plane projective curve C' defined by the homogeneous 
equation 

2deg(F)^^j^/2,7/Z) = 0. 
^A variety X over a field ko is geometrically irreducible if Xj^ is irreducible [k an algebraic closure of ko). 
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This is a projective (hence complete) algebraic curve which, in general, will have singular 
points. It is possible to resolve these singularities geometrically, and so obtain a nonsingular 
projective curve (see W. Fulton 1969, pl79). 

Theorem 7.5 Every compact Riemann surface X has a unique structure of a complete 
nonsingular algebraic curve. 

Proof. We explain only how to construct the associated algebraic curve. The underlying set 
is the same; the topology is that for which the open sets are those with finite complements; the 
regular functions on an open set U are the holomorphic functions on U that are meromorphic 
on the whole of X. □ 

Remark 7.6 Theorems 7.2 and 7.5 depend crucially on the hypothesis that the variety has 
dimension 1. 

In general, many different complete nonsingular algebraic varieties can have the same 
function field. A nonsingular variety U over a field of characteristic zero can be embedded 
in a complete nonsingular variety U, but this is a very difficult theorem (proved by Hironaka 
in 1964), and U is very definitely not unique. For a variety of dimension > 3 over a field of 
characteristic p > 0, even the existence of U is not known. 

For a curve, "complete" is equivalent to "projective"; for smooth surfaces they are 
also equivalent, but in higher dimensions there are many complete nonprojective varieties 
(although Chow's lemma says that a complete variety is not too far away from a projective 
variety). 

Many compact complex manifolds of dimension > 1 have no algebraic structure. 
The curve Xo{N) over Q 

According to Theorem 7.5, there is a unique structure of a complete nonsingular curve on 
Xo(N) compatible with its structure as a Riemann surface. We write Xo(N)c for Xo(N) 
regarded as an algebraic curve over C. Note that Xo(N)c is the unique complete nonsingular 
curve over C having the field C(J(z),j(Nz)) of modular functions for FoiN) as its field of 
rational functions. 

Now write Fn{X, Y) for the polynomial constructed in Theorem 6.1, and let C be the 
curve over Q defined by the equation: 

Fn{X,Y) = 0. 

As is explained above, we can remove the singular points of C to obtain a nonsingular curve 
(jns Q^gj. ^jj^ jj^gj^ g^j^ embed C"^ into a complete regular curve C. The coordinate 
functions x and y are rational functions on C , they generate the field of rational functions 
on C, and they satisfy the relation F^(x,}') = 0; these statements characterize C and the 
pair of functions x,y on it. 

Let Cc be the curve defined by C over C. It can also be obtained in the same way as C 
starting from the curve Fn{X, Y) = 0, now thought of as a curve over C. There is a unique 
isomorphism Cc Xo(N)£ making the rational functions x and y on Cc correspond to 
the functions j(z) and j(Nz) on Xq(N). We can use this isomorphism to identify the two 
curves, and so we can regard C as being a model of Xo{N)c over Q. We denote it by 
Xo(N)q. (In fact, we often omit the subscripts from Xo(N)ic and Xo(N)q.) 
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We can be a little more explicit: on an open subset, the isomorphism Xo{N) Cc 
is simply the map [z] (j(z),j(Nz)) (regarding this pair as a point on the affine curve 

Fn(X,Y) = 0). 

The action of Aut(C) on Xo(N) corresponding to the model Xo(N)q has the following 
description: for r e Aut(C), r[z] = [z'] if Tj{z) — j{z') and rj{Nz) — j{Nz'). 

The curve Xq{N)(q is called the canonical model of Xq{N) over Q. The canonical 
model X(1)q) of X{\) is just the projective line over Q. If the field of rational functions on 

is Q(r), then the identification of P^ with X(l) is made in such a way that T corresponds 
toy. 

The quotientmap Xq{N) X{\) corresponds to the map of algebraic curves Xq{N)(q 
X(1)q defined by the inclusion of function fields Q(T') Q(x,y), T x. On an open 
subset of Xo(N)(Q, it is the projection map {a,b) a. 
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8 Modular Curves as Moduli Varieties 

Algebraic geometers and analysists worked with "moduli varietes" that classify isomorphism 
classes of certain objects for a hundred years before Mumford gave a precise definition of a 
moduli variety in the 1960s. In this section I explain the general notion of a moduli variety, 
and then I explain how to realize the modular curves as moduli varieties for elliptic curves 
with additional structure. 

The general notion of a moduli variety 

Fix a field k which initially we assume to be algebraically closed. A moduli problem over k 
is a contravariant functor F from the category of algebraic varieties over k to the category of 
sets. In particular, for each variety V over k we are given a set T{V), and for each regular 
map (p:W V, we are given a map J^((p):J^(V) J^iW). Typically, J^iV) will be the set 
of isomorphism classes of certain objects over V. 

A solution to the moduli problem is a variety V over k together with an identification 
V(k) = T(k) and certain additional data sufficient to determine V uniquely. More precisely: 

Definition 8.1 A pair (V,a) consisting of a variety V over k together with a bijection 
a:T(k) V(k) is called a solution to the moduli problem T if it satisfies the following 

conditions: 

(a) Let r be a variety over k and let / € J^{T); a point t € T{k) can be regarded as a 
map SpecmA: — ^ T, and so (by the functoriality of F) f defines an element ft of 
F{k); we therefore have a map t oc(ft): T(k) V(k), and this map is required to 
be regular (i.e., defined by a morphism of algebraic varieties), 

T(k) V(k) f€F{T) 

t ^ ft=Ht){f)^Hk). 

(b) (Universality) Let Z be a variety over k and let ^ : T{k) Z(k) be a map such that, 
for every pair (T, f) as in (a), the map t Pift)'- T(k) Z(k) is regular; then the 
map ^oa~^: V{k) Z(k) is regular, 

Tik) V(k) 
Z(k). 

A variety V that occurs as the solution of a moduli problem is called a moduli variety. 

Proposition 8.2 Up to a unique isomorphism, there exists at most one solution to a 
moduli problem. 

Proof. Suppose there are two solutions (V,a) and (V',a'). Then because of the universal- 
ity of (V, a), a' oa~^:V ^ V is a regular map, and because of the universality of (V',a'), 
its inverse is also a regular map. □ 

Of course, in general there may exist no solution to a moduli problem, and when there 
does exist a solution, it may be very difficult to prove it. Mumford was given the Fields 
medal mainly because of his construction of the moduli varieties of curves and abelian 
varieties. 
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Remark 8.3 (a) For a variety V over k, let hy denote the functor T Hom(r, V). The 
condition (a) in (8.1) says that a extends to a natural transformation F ^ hy, which (b) 
says is universal. 

(b) It is possible to modify the above definition for the case that the ground field 
kn is not algebraically closed. For simplicity, we assume /cq to be perfect, and we fix 
an algebraic closure /c of /cq- Now K is a variety over k^ and a is a family of maps 
a{k'):F{k') —>■ V{k') (one for each algebraic extension k' of kn) compatible with inclusions 
of fields, and {Vk,a{k)) is required to be a solution to the moduli problem over k. If {V, a) 
and {V ,a') are two solutions to the same moduli problem, then a' oa~^: V{k) —>■ V'{k) 
and its inverse are both regular maps commuting with the action of G&\{k/ ko); they are both 
therefore defined over /co- Consequently, up to a unique isomorphism, there again can be at 
most one solution to a moduli problem. 

Note that we don't require a{k') to be a bijection when k' is not algebraically closed. 
In particular, V need not represent the functor F. When V does represent the functor, V is 
called a fine moduli variety; otherwise it is a coarse moduli variety. 

The moduli variety for elliptic curves 

We show that is the moduli variety for elliptic curves over a perfect field /cq- 
An elliptic curve E over a field k' is a curve given by an equation of the form, 

Y^Z + aiXYZ + a-iYZ^ = + a2X'^ Z + a^XZ^ + aeZ^ (14) 

for which the discriminant A{a\,a2,a?,,a^,a(,) ^ 0. It has a distinguished point (0: 1:0), 
and an isomorphism of elliptic curves over k' is an isomorphism of varieties carrying the 
distinguished point on one curve to the distinguished point on the second. (There is a unique 
group law on E having the distinguished element as zero, and a morphism of elliptic curves 
is automatically a homomorphism of groups.) 

Let F be a variety over a field k' . An elliptic curve (better, family of elliptic curves) 
over K is a map of algebraic varieties E ^ V where E is the subvariety of K x P-^ defined 
by an equation of the form (14) with the at regular functions on V; A{ai,a2,cfi,ciA,ci^) is 
now a regular function on V which is required to have no zeros. 

For a variety V , let 8{V) denote the set of isomorphism classes of elliptic curves over V . 
Then f is a contravariant functor, and so can be regarded as a moduli problem over k^. 

For any field k' containing ko, the j -invariant defines a map 

E ^ j{E):£(k') {k') = k', 

and the theory of elliptic curves (Milne 2006, II 2.1) shows that this map is an isomorphism 
if k' is algebraically closed (but not in general otherwise). 

Theorem 8.4 The pair (A^ , j) is a solution to the moduU problem £. 

Proof. For any /:o-homomorphism o:k' k", jioE) = aj(E), and so it remains to show 
that (A \ 7 ) satisfies the conditions (a) and (b) over k. 

Let E ^ T be a. family of elliptic curves over T, where T is a variety over k. The map 
t i-> j(Et):T(k) A^ (k) is regular because j(Et) = c^/ A where C4 is a polynomial in 
the Ui 's and A is a nowhere zero polynomial in the a, 's. 
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Now let (Z,j6) be a pair as in (b). We have to show that j ^(Ej):A^(k) Z(k), 
where Ej is an elMptic curve over k with y -invariant 7, is regular. Let U be the open subset 
of obtained by removing the points and 1728. Then 

E:Y^Z + XYZ ^ — — XZ^ ^- Z^ ueU, 

M-1728 M-1728 

is an elliptic curve over U with the property that j{Eu) = u (Milne 2006, 11 2.3). Because 
of the property possessed by (Z, P), E/U defines a regular map u fi{Eu)'- U Z. But 
this is just the restriction of the map j ^(Ej) to U(k), which is therefore regular, and it 
follows that j itself is regular. □ 



The curve Yo(N)q as a moduli variety 

Let ^ be a perfect field, and let N hea positive integer not divisible by the characteristic 
of k (so there is no restriction on when k has characteristic zero). Let E be an elliptic 
curve over k. When k is an algebraically closed field, a cyclic subgroup of E of order N is 
simply a cyclic subgroup of E(k) of order A'^ in the sense of abstract groups. When k is not 
algebraically closed, a cyclic subgroup of £■ is a Zariski-closed subset S such that S(k^^) 
is cyclic subgroup of £(/r^') of order A'^. Thus S(k'^^) is a cyclic subgroup of order of 
E(k''^^) that is stable (as a set — not elementwise) under the action of Gal(fc^V k), and every 
such group arises from a (unique) S. 

An isomorphism from one pair (E, S) to a second (E', S') is an isomorphism E ^ E' 
mapping S onto 5". 

These definitions can be extended in a natural way to families of elhptic curves over 
varieties. 

For a variety V over k, define £^o,iv(^) to be the set of isomorphism classes of pairs 
{E,S) where E is an elliptic curve over V , and 5 is a cyclic subgroup of E of order A^. 
Then <5o,iV is a contravariant functor, and hence is a moduli problem. 

Recall that A{(j)i,co2) is the lattice generated by a pair {0)1,(02) with '^{0)1/0)2) > 0. 
Note that A{ci)\, 0)2) / A{(Oi, 0)2) is a cyclic subgroup of order A'^ of the elliptic curve 
<C/A{o)i,a)2). 

Lemma 8.5 The map 

H So,n{C), z ^ iC/A(z, 1), A(z,N-'^)/A{z, 1)) 
induces a bijection ro(N)\M 6o,n(C). 

Proof. Easy— see Milne 2006, V 2.7. □ 

Let 6q ^(k) denote the set of isomorphism classes of homomorphisms of elliptic curves 
a:E ^ E' over k whose kernel is a cyclic subgroup of E of order N. The map 

a (E,KeT{a)y.£Q j^{k) £o,N(k) 

is a bijection; its inverse is (E, S) (E ^ E/S). For example, the element 

{C/A{z,\),A{z,N-^)/A{z,l)) 

of £:o,iv(C) corresponds to the element (C/A{z, 1) ^ C/A{Nz, 1)) of £"^^^(C). 
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Let Fj<^{X, F) be the polynomial defined in Theorem 6.1 and let C be the (singular) 
curve Fn(X, Y) = over Q. For any field k D Q, consider the map 

S'^^^ik) ^ A^k), {E,E') ^ UiE)J(E')). 

When k = C, the above discussion shows that the image of this map is contained in C(C), 
and this impUes that the same is true for all k. 

Recall that Yo(N) = roiN)\E.. There is an affine curve Yo{N)q C Xo(A^)q which is a 
model of Yo(N) C XoiN). (This just says that the set of cusps on Xo(N) is defined over 

Q.) 

Theorem 8 . 6 Let k be a field, and let N be an integer not divisible by the characteristic of 
k. The moduli problem Sq j^ has a solution (M,a) over k. When k = Q, M is canonically 
isomorphic to Yo(N)q, and the map 

£o,N(k) ^ M(k) ^ Yo(N)Q(k) C(k) 
is (E,S)^U(E)J(E/S)). 

Proof. When k = Q,it is possible to prove that Yo(N)q is a solution to the moduli problem 
in much the same way as for above. If p \ N , then it is possible to show that Yq{N)q 
has good reduction at p, and the curve 1o(//)f^ over it reduces to is a solution to the 
moduli problem over Fjp. □ 

The curve Y{N) as a moduli variety 

Let A' be a positive integer, and let ^ e C be a primitive A'^th root of 1. A level- N structure 
on an elliptic curve £" is a pair of points t = {ti, t2) in E{k) such that the map 

(m,m')h^ imti,mt2):Z/NZxZ/NZ^ E(k) 

is injective. This means that E(k)N has order A'^-^, and ti and ?2 form a basis for E(k)N 
as a Z/ «Z-module. For any variety V over a field k D Q[^], define S;^{V) to be the set of 
isomorphism classes of pairs (E,t) where E is an elliptic curve over V and t = (ti, Z2) is a 
level- A'^ structure on E such that CNitiJi) = t (here is the Weil pairing — see Silverman 
III.8). Then Si^i is a contravariant functor, and hence is a moduli problem. 

Lemma 8.7 The map 

U-^£n{C), zh^ (C/A(z,1),(z/A^,1/A^) modyl(z,l)) 

induces a bijection r(N)\M £^(C). 

Proof. Easy. □ 

Theorem 8 . 8 Let k be a field containing Q[^] , where t, is a primitive N th root of 1 . The 
moduli problem has a solution (M, a ) over k . When k = C, M is canonically isomorphic 
to Y{N)c (= X(N)c with the cusps removed). Let M be the solution to the moduli problem 
£jq over Q[^]; then M has good reduction at the prime ideals not dividing N . 

Proof. Omit. □ 
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Example 8.9 For N = 2, the solution to the moduU problem is A . In this case, there is a 
universal elliptic curve with level-2 structure over A\ namely, the curve 

E:Y^Z = X{X-Z){X-XZ). 

Here A is the coordinate on A\ and the map E is (x : y : z,X) X. The level-2 

structure is the pair of points (0:0:1), (1:0:1). The curve E is universal in the following 
sense: for every family of elliptic curves E' ^ V with level-2 structure over a variety 
V (with the same base field k), there is a unique morphism V ^ such that E' is the 
pull-back of E. In this case the map S(k) A^ (k) is an isomorphism for all fields k D Q, 
and A^ is a fine moduli variety. 
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9 Modular Forms, Dirichlet Series, and Functional Equations 

The most famous Dirichlet series, i^{s) Yl'n^=i'^~^ ^ shown by Riemann (in 1859) to 
have an analytic continuation to the whole complex plane except for a simple pole at 5 = 1 , 
and to satisfy a functional equation 

Z(s) = Z(l-s) 

where Z{s) = n~^l^ r{s /l)t.{s). The Hasse-Weil conjecture states that all Dirichlet series 
arising as the zeta functions of algebraic varieties over number fields should have mero- 
morphic continuations to the whole complex plane and satisfy functional equations. In this 
section we investigate the relation between Dirichlet series with functional equations and 
modular forms. 

We saw in (2.12) that the modular group r{\) is generated by the matrices T — {^\) 
and S = (_j q). Therefore a modular function /(z) of weight 2k satisfies the following 
two conditions: 

/(Z + 1) = /(Z), /(-1/Z) = (-Z)2V(Z). 

The first condition implies that / (z) has a Fourier expansion /(z) = ^ , and so defines 
a Dirichlet series (p{s) = '^Unn^^ . Hecke showed that the second condition implies that 
the Dirichlet series satisfies a functional equation, and conversely every Dirichlet series 
satisfying a functional equation of the correct form (and certain holomorphicity conditions) 
arises from a modular form. Weil extended this result to the subgroup ro(N) of r(l), which 
needs more than two generators (and so we need more than one functional equation for the 
Dirichlet series). In this section we explain Hecke's and Weil's results, and in later sections 
we explain the implications of Weil's results for elliptic curves over Q. 

The Mellin transform 

Let ai,a2>--- be a sequence of complex numbers such that a„ = 0(n^) for some M. 
This can be regarded as the sequence of coefficients of either the power series f{q)=- 
Y^'^ dnl", which is absolutely convergent for \q\ < 1 at least, or for the Dirichlet series 
V(^) = X^i"*^"""^' which is absolutely convergent for dl{s) > M + 1 at least. In this 
subsection, we give explicit formulae that reahze the formal correspondence between / (y) 
and (p(s). 

Recall that the gamma function r(s) is defined by the formula, 

/■oo 

Jo 

It has the following properties: r(s + 1) = sr(s), r(l) = 1, and r{^) = .Jji\ r(s) extends 
to a function that is holomorphic on the whole complex plane, except for simple poles at 
s = —n, where it has a residue ^ ^/ , w = 0, 1,2, . . .. 

Proposition 9.1 (Mellin Inversion Formula) For every real c > 0, 
e""" = — / r(s)x~'ds, X > 0. 

-^^l Jc—ioo 

(The integral is taken upwards on a vertical line.) 
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Proof. Regard the integral as taking place on a vertical circumference on the Riemann 
sphere. The calculus of residues shows that the integral is equal to 



oo oo ^ 



2jti ^2 reSi=-«-^ ^ ^(s) = 27Ti - — |^ = Ini -e 



n=0 n=0 

Alternatively, note that the Mellin inversion formula is just the Fourier inversion formula for 
the group M-|- with invariant measure dx/x. □ 

Theorem 9.2 Let a^, fl2> ■ • • be a sequence of complex numbers such that a„ = 0(n^) 
for some M. Write f(x) = Y.T ^ne""'' and(j)(s) = J^T ^nn~' . Then 

/■oo 

r(5)(/)(5)= / f(x)x'-Ux for9i(5)>max(0,M + l), (*) 
Jo 

f(x) = :/ ^(s)r(s)x~'ds forc>max(0,M + 1) and'in{s)>0. (**) 



lOO 



Proof. First consider (*). Formally we have 



/■oo /■oo °° 

/ f(x)x'-Ux^ / Yane-""" x'-Ux 
Jo Jo J 

°° /■oo 

= / a„e~"^x^~^dx 

oo 

1 

on writing x for nx in the last integral and using the definition of r(s). The only problem is 
in justifying the interchange of the integral with the summation sign. 

The equation (**) follows from Proposition 9.1. □ 

The functions / (x) and (p(s) are called the Mellin transforms of each other. 

The equation (*) provides a means of analytically continuing (p{s) provided f(x) tends 
to zero sufficiently rapidly at x = 0. In particular, if / (x) = 0(x^) for every ^4 > as x ^ 
through real positive values, then r(s)<p(s) can be extended to a holomorphic function over 
the entire complex plane. Of course, this condition on / (x) implies that x = is an essential 
singularity. 

We say that a function (p(s) on the complex plane is bounded on vertical strips, if for 
all real numbers a < b, (p(s) is bounded on the strip a <dl(s) < b as ^s(s) ±oo. 

Theorem 9.3 (Hecke 1936) Let aQ,a\,a2, . . . he a sequence of complex numbers such 
that an = 0{n^) for some M. Given A>0, ^>0, C = ±l, write 

(a) (p{s) = J2^nn^^', ((p(s) converges for 'iR(s) > M + I) 

(b) 0{s) = {^y'ris)(p{s); 

(c) f(z) = fl„e27r/«z/A . (converges for S(z) > O;. 
Then the following conditions are equivalent: 
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(i) The function <P{s) + ^ + can be analytically continued to a holomorphic function 
on the entire complex plane which is bounded on vertical stirips, and it satisfies tiie 
functional equation 

(p{k-s) = C<P{s). 

(ii) In the upper half plane, f satisfies the functional equation 

f(-l/z) = C(z/iffiz). 
Proof. Given (ii), apply (*) to obtain (i); given (i), apply (**) to obtain (ii). □ 

Remark 9.4 Let r'(X) be the subgroup of F(l) generated by the maps z z + X and 
z — 1/z. A modular form of weight k and multiplier C for r'(X) is a holomorphic 
function / (z) on H such that 

/(z + A) = /(z), /(-1/z) = dz/iffiz), 

and / is holomorphic at /oo. This is a slightly more general notion than in Section 4 — ^if k 
is an even integer and C = 1 then it agrees with it. 

The theorem says that there is a one-to-one correspondence between modular forms of 
weight k and multiplier C for r'(X) whose Fourier coefficients satisfy a„ = 0(n^) for 
some M, and Dirichlet series satisfying (i). Note that <P(s) is holomorphic if / is a cusp 
form. 

For example ^(s) corresponds to a modular form of weight 1/2 and multiplier 1 for 

r'(2). 



Weil's theorem 

Given a sequence of complex numbers ai,a2,-- - such that a„ = 0(n^) for some M, write 

00 oo 

L(5) = ^«„n-^ A(s) = (2n)-'r(s)L(s), /(z) = ^fl„e2'^'«^ (15) 

n=l n=l 

More generally, let m > be an integer, and let ;f be a primitive character on (Z/mZ)^ 
(primitive means that it is not a character on (Z/dZ)^ for any proper divisor dofm). As 
usual, we extend x(s) to the whole of Z/ mZ by setting /(«) = if n is not relatively prime 
to m. We write 

oo oo 

L;,(5) = ^fl„/(«)«-^ A^{s) = i^rris)L^is), fx{z) = J2anX(n)e^''""- 

n=l n=l 

Note that and f), are the Mellin transforms of each other. 
For any x, the associated Gauss sum is 



n=l 



Obviously xia)g(x) = E/(«)e~^''''"'/'", and hence 



9. Modular Forms, Dirichlet Series, and Functional Equations 



107 



It follows from this last equation that 

m 

A = '^"Va)En«)/u(om)- 
1 

Theorem 9.5 Let /(z) be a modular form of weight 2k for ro(N), and suppose that 
f\k{^ -1) = C{-lff for some C = ±1. Dehne 

Cx = Cg{x)xi-N)/g(j). 
Then Ay^{s) satisfies the functional equation: 

Ajf^(s) = Cy_N^~^ A^{2k-s) whenever gcd(m,N) = 1. 
Proof. Apply Theorem 9.3. □ 

The most interesting result is the converse to this theorem. 

Theorem 9.6 (Weil 1967) Fix a C = ±1, and suppose that for all but finitely many 
primes p not dividing N the following condition holds: for every primitive character x 
of (Z/ pl')^, the functions A(s) and A^(s) can be analytically continued to holomorphic 
functions in the entire complex plane and that each of them is bounded on vertical strips; 
suppose also that they satisfy the functional equations: 

A(s) = CN^-'A{2k-s) 

A^{s) = CxN^-'A^(2k-s) 

where is defined above; suppose further that the Dirichlet series L(s) is absolutely 
convergent for s — k — e for some e > 0. Then f (z) is a cusp form of weight 2k for rQ{N). 

Proof. Several pages of manipulation of 2 x 2 matrices. □ 

Let E be an elliptic curve over Q, and let L(E,s) be the associated L-series. As we 
shall see shortly, it is generally conjectured that L(s) satisfies the hypotheses of the theorem, 
and hence is attached to a modular form / (z) of weight 2 for FoiN). Granted this, one can 
show that there is nonconstant map a: Xq{N) E (defined over Q) such that the pull-back 
of the canonical differential on E is the differential on X{){N) attached to /(z). 

Remark 9.7 Complete proofs of the statements in this section can be found in Ogg 1969, 
especially Chapter V. They are not particularly difficult — it would only add about 5 pages to 
the notes to include them. 
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10 Correspondences on Curves; the Theorem of 

Eichler-Shimura 

In this section we sketch a proof of the key theorem of Eichler and Shimura relating the 
Hecke correspondence Tp to the Frobenius map. In the next section we explain how this 
enables us to realize certain zeta functions as the MelUn transforms of modular forms. 



The ring of correspondences of a curve 

Let X and X' be projective nonsingular curves over a field k which, for simplicity, we take 
to be algebraically closed. 

A correspondence T between X and X' is a pair of finite surjective morphisms 

X^Y^X'. 

It can be thought of as a many- valued map X ^ X' sending a point P € X{k) to the set 
{^{Qi)} where the Qi run through the elements of (P) (the Qi need not be distinct). 
Better, define Div(X) to be the free abelian group on the set of points of X; thus an element 
of Div(X) is a finite formal sum 

D = ^npP, npeZ, PeC. 

A correspondence T then defines a map 

Div(X) ^ Div(ZO, P ^ ^/6(eO, 

(notations as above). This map multiplies the degree of a divisor by deg(a). It therefore 

sends the divisors of degree zero on X into the divisors of degree zero on X', and one 
can show that it sends principal divisors to principal divisors. It therefore defines a map 
T:J(X)^ J(X') where 

J(X) = Div°(Z)/{ principal divisors}. 

We define the ring of correspondences A(X) on X to be the subring of End(J(X)) gener- 
ated by the maps defined by correspondences. 
If T is the correspondence 




then the transpose T' of T is the correspondence 

X'^Y^X. 

A morphism a:X ^ X' can be thought of as a correspondence 

x^r ^X' 

where F C X x X' is the graph of a and the maps are the projections. 



Aside 10.1 Attached to any complete nonsingular curve X there is an abelian variety Jac(X) 
whose set of points is J{X). The ring of correspondences is the endomorphism ring of Jac{X) — see 
the next section. 
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The Hecke correspondence 

Let r be a subgroup of F(l) of finite index, and let a be a matrix with integer coefficients 
and determinant > 0. Write FaF = [J Fat (disjoint union). Then we get a map 

T{a): J(X(F)) ^ J(X(F)), [z] ^ ^Kz]. 

As was explained in Section 5, this is the map defined by the correspondence: 

X(F) ^ X(Fc) ^ X(F) 

where Fa = F Da^^ Fa. In this way, we get a homomorphism T-L ^ A from the ring of 
Hecke operators into the ring of correspondences. 

Consider the case F = Fo{N) and T = T(p) the Hecke correspondence defined by the 
double coset Fo(N) (g p) Fo(N). Assume that p \ N . We give two further descriptions of 
T(p). 

First, identify a point of Yo(N) (over C) with an isomorphism class of homomorphisms 
E ^ E' of elliptic curves with kernel a cyclic group of order A'^. The subgroup Ep of E of 
points of order dividing p is isomorphic to (Z/ />Z) x (Z/ /)Z). Hence there are /) + 1 cyclic 
subgroups of Ep of order p, say So,Si,...,Sp (they correspond to the lines through the 
origin in F^). Then (as a many-valued map), T{p) sends a:E E' to 

{Ei^E[\i=Q,l,...,p} 

where £",■ = E / Si and E'- = E'/a(Si). 

Second, regard Yo(N) as the curve C defined by the polynomial Fn{X, Y) constructed 
in Theorem 6.1 (of course, this isn't quite correct — there is a map Yo(N) — ^ C, [z] 
(J(z), j(Nz)), which is an isomorphism over the nonsingular part of C). Let (J,j') be 
a point on C ; then there are elliptic curves E and E' (well-defined up to isomorphism) 
such that j = j(E) and j' = j{E'). The condition Fj<f{j,j') = implies that there is a 
homomorphism a:E ^ E' with kernel a cyclic subgroup of order A^. Then T{p) maps 
0\ /) to {{ji , i/) \i=Q,...,p} where y,- = j{E/Si) and ;/ = j{E' /a{Si)). 

These last two descriptions of the action of T(p) are valid over any field of characteristic 

0. 

The Frobenius map 

Let C be a curve defined over a field k of characteristic p ^ 0. Assume (for simplicity) that 
k is algebraically closed. If C is defined by equations ^Cif^iyX'^ X[^ ■■• = and ^ is a 
power of p, then C^^^ is the curve defined by the equations ^cf^i^...X'Q X[^ ■ ■• = 0, and the 

Frobenius map Fig :C ^ C^'^^ sends the point (ao '■ a\ : ■ • • ) to (Aq : a[: ■■■). Note that if 
C is defined over F^, so that the equations can be chosen to have coefficients c/q/j... in F^, 
then C = C and the Frobenius map is a map from C to itself. 

Recall that a nonconstant morphism a : C — ^ C ' of curves defines an inclusion a* :k{C') 
k{C) of function fields, and that the degree of a is defined to be [k{C) : a*k{C')\. The map 
a is said to be separable or purely inseparable according as /:(C) is a separable or purely 
inseparable extension of a*k{C'). If the separable degree of k{C) over a*k{C') is m, then 
the map C{k) —>■ C (k) h m : \ except on a finite set (assuming k to be algebraically closed). 
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Proposition 10.2 The Frobenius map flq .C —>■ C*^*-* is purely inseparable of degree q, 
and every purely inseparable map (p:C —>■ C of degree q (of complete nonsingular curves) 
factors as 

Proof. Note that 

n*k{C) = kiC^'i^) = kiCy = {a'^\ae k{C)} 

It follows that k{C) is purely inseparable of degree q over k{Cy , and that this statement 
uniquely determines k{C)'^. The last sentence is obvious when k(C) = k(T) (field of 
rational functions in T), and the general case follows because k(C) is a separable extension 
of such a field k(T). □ 

Brief review of the points of order p on elliptic curves 

Let E be an elliptic curve over an algebraically closed field k. The map p:E ^ E (multipli- 
cation by p) is of degree p-^. If k has characteristic zero, then the map is separable, which 
implies that its kernel has order p-^. If k has characteristic p, the map is never separable: 
either it is purely inseparable (and so E has no points of order p) or its separable and 
inseparable degrees ai^e p (and so E has p points of order dividing p). In the first case, 
(10.2) tells us that multiplication by p factors as 

E X E. 

Hence this case occurs only when E ^ E^P'\ i.e., when j(E) = j(E'<P = j{E)P . Thus 
if E has no points of order p, then j(E) e ¥p2. 

The Eichler-Shimura theorem 

The curve Xo{N) is defined over Q and the Hecke correspondence T(p) is defined over 
some number field K. For almost all primes p \ N, Xo(N) will reduce to a nonsingular 
curve Xo(N).^ For such a prime p, the correspondence T(p) defines a correspondence 
f(p) onXo(N). 

Theorem 10.3 For a prime p where Xo{N) has good reduction, 

fp = np + n'p 

(equality in the ring A{X{){N)) of correspondences on Xq{N) over the algebraic closure F 
of¥p; here Wp is the transpose of Hp). 

Proof. We show that they agree as many-valued maps on an open subset of Xii{N). 

Over we have the following description of Tp (see above): a homomorphism of 
elliptic curves a:E E' with cyclic kernel of order A'^ defines a point (j(E), j{E')) on 
Xq(N); let 5*0, . . . , 5*^ be the subgroups of order p in E; then 

Tp{j(E),j{E')) = {(j(Ei)J(E[))} 

■'in fact, it is known that Xq(N) has good reduction for all primes p \ N, but this is hard to prove. It is easy 
to see that Xo(N) does not have good reduction at primes dividing A'. 
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where £",■ = E/Si and — E'/a(Si). 

Consider a point P on Xo(N) with coordinates in F. Ignoring a finite number of points 
of Xo(N), we can suppose P e Yo(N) and hence is of the form (j(E),j(E')) for some 
map a.: E E' . Moreover, we can suppose that E has p points of order dividing p. 

Let a: £■ ^ be a lifting of a to Q^'. The reduction map Ep{Qf) £p(F^') has a 
kernel of order p. Number the subgroups of order in £" so that Sq is the kernel of this map. 
Then each Si, i ^ 0, maps to a subgroup of order p in E. 

The map p:E ^ E factors as 

E -> E/Si E. 

When z = 0, ^ is a purely inseparable map of degree p (it is the reduction of the map 
E — ^ E/So — it therefore has degree p and has zero (visible) kernel), and so ^ must be 
separable of degree p (we are assuming E has p points of order dividing p). Proposition 10.2 
shows that there is an isomorphism E^^^ E / So- Similarly E'^^^ E' / So- Therefore 

(;(^o),;(^o)) = UiE^''^)J(E'^''^)) = UiEVjiEY) = np{j{E)J{E')). 

When i j^O, (f is separable (its kernel is the reduction of 5, ), and so xj/ is purely inseparable. 
Therefore E E^^\ and similarly E' E'-^p\ Therefore 

ijiEi)^P\jiEl)^P^) = (jiE)JiE')). 

Hence 

{(J(Ei),j(E',))\i = 1,2,. -.^p} 

is the inverse image of Up, i.e., it is n'p{j{E), j{E')). This completes the proof of the 
theorem. □ 
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11 Curves and their Zeta Functions 

We begin by reviewing the theory of the zeta functions of curves over Q; then we explain 
the relation between the various representations of the ring of correspondences; finally we 
explain the implications of the Eichler-Shimura theorem for the zeta functions of the curves 
Zo(A'^) and elliptic curves; in particular, we state the conjecture of Taniyama-Weil, and 
briefly indicate how it impUes Fermat's last theorem. 

Two elementary results 

We begin with two results from linear algebra that will be needed later. 

Proposition 11.1 Let A be a free Z-module of finite rank, and leta:A^ Abea Z-linear 
map with nonzero determinant. Then the kernel of the map 

a:(A^q)/A^(A^q)/A 

defined by a has order \ det(Q!) | . 

Proof. Consider the commutative diagram: 

> A > A<S>Q > {A<S>Q)/A > 

0!®1 

> A > A<SiQ > (A<SiQ)/A > 0. 

Because det(o;) ^ 0, the middle vertical map is an isomorphism. Therefore the snake lemma 
gives an isomorphism 

Ker(5) Coker(a) . 

There exist bases e = {ei, . . . ,6^} and / = {/i,...,/^} for A such that a(ei) = cifi, 
Ci € Z, for all / (apply ANT 2.44 to both the rows and columns of some matrix for a). Now 
#Coker(a!) = |ci •••c,nl = |det(a!)|. □ 

Let F be a real vector space. To give the structure of a complex vector space on V 
compatible with its real structure amounts to giving an M-linear map J:V such that 
= —\. Such a map extends by linearity to C (8)k V, and C (8)m V splits into a direct sum 

with (resp. V~) the +1 (resp. —1) eigenspace of /. 
Proposition 1 1 .2 (a) The map 

vh^l^v „ project , 

(y,J) ^C(8)mF-5-^F+ 



is an isomorphism of complex vector spaces, 
(b) The map 

z(8>tii-^z(8>t;:F(8>RC^F(8>MC 
is an R-Unear involution of V (8>r C interchanging F"*" and V' 
Proof. Left as an easy exercise. 
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Corollary 11. 3 Let a be a C-linear endomoqjhism of V. Write A for the matrix of 
a regarded as an M.-linear endomorphism of V , and Ai for the matrix of a as a C-linear 
endomorphism of V . Then 

A-^ Ai®Ai. 

(By this I mean that the matrix A is similar to the matrix 

Proof. Follows immediately from the above Proposition. □ 

For V = C and the map "multiplication by a = a + ib", the statement becomes, 

/a -h\ fa + ih \ 
\b a ) \ a — ibj' 

which is obviously true because the two matrices are semisimple and have the same trace 
and determinant. 




The zeta function of a curve over a finite field 

The next theorem summarizes what is known. 

Theorem 1 1.4 Let C he a complete nonsingular curve of genus g over Fg. Let N„ be 
the number of points of C with coordinates in ¥qn. Then there exist algebraic integers 
o; 1 , a2 , ■ ■ • , 0C2g (independent ofn) such that 

Nn = l+q"-J2<-^ (16) 
/ = i 

moreover, the numbers q/at are a permutation of the at, and for each i, |q;,- | = q^l-^ . 

All but the last of these assertions follow in a straightforward way from the Riemann- 
Roch theorem (see M. Eichler, Introduction to the Theory of Algebraic Numbers and 
Function, Academic Press, 1966, V 5.1). The last is the famous "Riemann hypothesis" for 
curves, proved in this case by Weil in the 1940s. 

Define Z(C, /) to be the power series with rational coefficients such that 

oo 

\ogZ{C,t) = Y,Nnt"/n. 

n=l 

Then (16) is equivalent to the formula 

(l-aiO---(l-a«0 



Z(C,0 



(1-0(1-^0 



(because — log(l — aZ) = 'Y^a"t"' / n). 

Define i^{C,s) = Z(C,q~^). Then the "Riemann hypothesis" is equivalent to ^(C,s) 
having all its zeros on the line !)i (5) = 1/2, whence its name. One can show that 



where Nx is the number of elements in the residue field at x, and so the definition of ^(C, s) 
is quite similar to that of ^(Q,^). 
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The zeta function of a curve over Q 

Let C be a complete nonsingular curve over Q. For all but finitely many primes p, the 
reduction C{p) of C modulo p will be a complete nonsingular curve over ¥ p. We call the 
primes for which this is true the "good primes" for C and the remainder the "bad primes". 
We set 

p 

where i^p{C,s) is the zeta function of C{p) when p is, & good prime and is as defined in 
(Serre, Seminaire DPP 1969/70; Oeuvres, Vol II, pp 581-592) when p is a bad prime. 

On comparing the expansion of t,{C,s) as a Dirichlet series with '^n~^ and using the 
Riemann hypothesis (1 1.4), one finds that t,{C,s) converges for ^{s) > 3/2. It is conjectured 
that it can be analytically continued to the entire complex plane except for simple poles at 
the negative integers, and that it satisfies a functional equation relating t,{s) to t,(2 — s). Note 
that we can write 

L{C,s) 

where 



^ il-ai{p)p ')---(l-a2g(p)p ') 

For an elliptic curve E over Q, there is a pleasant geometric definition of the factors of 
L(E,s) at the bad primes. Choose a Weierstrass minimal model for E, and reduce it mod p. 
If E(p) has a node at which each of the two tangents are rational over F^, then the factor is 
(1 — p~^)~^; if E(p) has a node at which the tangents are not separately rational over 
(this means that the tangent cone is a homogeneous polynomial of degree two variables with 
coefficients in F^ that does not factor over ¥p), then the factor is (1 + p~^)~^; if E(p) has 
a cusp, then the factor is 1. 

The geometric conductor of E is defined to be 

N = Y\p^'' 
p 

where = if £ has good reduction at /p = 1 if E(p) has a node as its only singularity, 
and > 2 if E(p) has a cusp (with equality unless p = 2,3). Write 

A(s) = (2n)-'r(s)L(E,s). 

Then it is conjectured that A{s) can be analytically continued to the entire complex plane as 
a holomorphic function, and satisfies the functional equation: 

A(s) = ±N^~'A(2-s). 

More generally, let m > be a prime not dividing A'^ and let x be primitive character of 
(Z/mZ)^. If 

L(E,s) = J2cnn-', 

we define 
and 

Ax(E,s) = (m/2jtyr(s)Lx(E,s). 
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It is conjectured that A^{E,s) can be analytically continued to the whole complex plane as 
a holomorphic function, and that it satisfies the functional equation 

A,{E,s) = ±(g(x)x(-N)/g(x))N'-'A^(E,2-s} 

where 

m 

n = l 

Review of elliptic curves 

(See also Milne 2006.) Let E be an elliptic curve over an algebraically closed field k, and let 
A = End(£'). Then Q (8)z A is Q, an imaginary quadratic field, or a quaternion algebra over 
Q (the last case only occurs when k has characteristic p and then only for supersingular 
elliptic curves). 

Because E has genus 1, the map "^nilPi] :Div*'(£') — > E{k) defines an 

isomorphism J{k) E(k). 

Here A is the full ring of correspondences of E. Certainly, any element of A can be 
regarded as a correspondence on E. Conversely a correspondence 

E ^ E 

defines a map E(k) E(k), and it is easy to see that this is regular. 
There are three natural representations of A. 

o Let W = TgtQ(£), the tangent space to E at 0. This is a one-dimensional vector space 
over k. Since every element a of ^ fixes 0, a defines an endomorphism da of W. We 
therefore obtain a homomorphism p:A^ End(W). 

o For any prime £ ^ char(/:), the Tate module of £ is a free -module of rank 2. 
We obtain a homomorphism pi.A^ End(7££). 

o When k = C, Hi (E,Ij) is a free Z-module of rank 2, and we obtain a homomorphism 
PB.A-^End(Hi(E,Z)). 

Proposition 11.5 When k = C, 

PB<SiI^l~P£, PB<8>C~p©p. 

(By this I mean that they are isomorphic as representations; from a more down-to-earth point 
of view, this means that if we choose bases for the various modules, then the matrix (pb(q!)) 
is similar to (pi(a)) and to (^''^^^ j for all a € A.) 

Proof. Write E = C/A. Then C is the universal covering space of E and A is the group 
of covering transformations. Therefore A — ni(E,0). From algebraic topology, we know 
that Hi is the maximal abelian quotient of ni, and so (in this case), 

Hi{E,'E) ~ ni{E,Gi) ~ A (canonical isomorphisms). 

The map E defines an isomorphism C TgtQ(£). But 71 is a lattice in C (regarded 
as a real vector space), which means that the canonical map A (8>z M ^ C is an isomorphism. 
Now the relation Pb ^ p® p follows from (11.3). 
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Next, note that the group of points of order on E, E^n , is equal to I ^ A/ A. There 
are canonical isomorphisms 

A^-iiZ/l^Z) = A/l^A^—^r^A/A = E^N. 

When we pass to the inverse hmit, these isomorphisms give an isomorphism A^'L^'^TiE .n 

Remark 11.6 There is yet another representation of A. Let Q^{E) be the space of 
holomorphic differentials on It is a one-dimensional space over k. Moreover, there is a 
canonical pairing 

Q\E)^ Tgt^(E)^k. 

This is nondegenerate. Therefore the representation of A on ^2^{E) is the transpose of the 
representation on TgtgCE). Since both representations are one-dimensional, this means that 
they are equal. 

Proposition 11.7 For every nonzero endomorphism a of E, the degree of a is equal to 
det(pia). 

Proof. Suppose first that = C, so that we can identify E{C) with C/A. Then E(C) tors = 
(A(SiQ)/ A, and (11.1) shows that the kernel of the map E{C) tors E (C) tors defined by a 
is finite and has order equal to det(pB(a)). But the order of the kernel is deg(o;) and (11.5) 
shows that det(pB(a)) = det(p£(o!)). 

For k of characteristic zero, the statement follows from the case k = C For a proof for 
an arbitrary ^ in a more general setting, see my notes on Abelian Varieties, I, 10.20. □ 

Corollary 11.8 Let E be an elliptic curve over ¥p ; then the numbers a i and ai occur- 
ring in (11.4)are the eigenvalues of Tip acting on TgE foralll ^ p. 

— def 

Proof. The elements of E{¥q) are exactly the elements of E{¥ p) that are fixed by Fig = 
rjp, i.e., E(¥g) is the kernel of the endomorphism 77^ — 1. This endomorphism is separable 
(rjp obviously acts as zero on the tangent space), and so 

Nn=dcg(n;-i) 

= det{pdn"p-l)) 
= «-l)(a^-l) 

= q-ai-a2 + l. □ 

We need one last fact. 

Proposition 1 1.9 Let a' be the transpose of the endomorphism a of E; then pi(a') is 
the transpose of pi(a). 

The zeta function of Xo(N): case of genus 1 

When is one of the integers 11, 14, 15, 17, 19, 20, 21, 24, 27, 32, 36, or, 49, the curve 
Xo(N} has genus 1. Recall (p90) that the number of cusps'* of FoiN) is 

J2 <p(gcd(d,N/d)). 

d\N,d>0 

^For a description of the cusps on Xo{N) and their fields of rationality, see Ogg, Rational points on certain 
elliptic modular curves, Proc. Symp. P Math, 24, AMS, 1973, 221-231. 
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If A'^ is prime, then there are two cusps, and / oo, and they are both rational over Q. If A'^ is 
one of the above values, and we take / oo to be the zero element of XoiN), then it becomes 
an elliptic curve over Q. 

Lemma 11.10 There is a natural one-to-one correspondence between the cusp forms of 
weight 2 for FoiN) and the holomorphic differential forms Xo(N) (overC). 

Proof. We know that f \-> fdz gives a one-to-one correspondence between the meromor- 
phic modular forms of weight 2 for Fq^N) and the meromorphic differentials on Xq{N), but 
Lemma 4. 1 1 shows that the cusp forms correspond to the holomorphic differential forms. □ 

Assume that Xq{N) has genus one. Let ct) be a holomorphic differential on X{){N); 
when we pull it back to H and write it / {z)dz, we obtain a cusp form / (z) for FoiN) of 
weight 2. It is automatically an eigenform for T(p) all p \ N, and we assume that it is 
normalized so that f(z) = with ai = 1. Then T(p) ■ f = Upf. One can show that 

Up is real. 

Now consider Xq{N), the reduction of X{){N) modulo p. Here we have endomorphisms 
rip and n'p, and FIp o Hp = deg(/7p) = p. Therefore 

(h - pdnp)T){h - pdn'p)T) = i2- (pdHp + n'p))T + pT^. 

According to the Eichler-Shimura theorem, we can replace Flp -\- fJp by T(p), and since 
the ^-adic representation doesn't change when we reduce modulo p, we can replace T(p) 
by T(p). The right hand side becomes 

Now take determinants, noting that Hp and Hp, being transposes, have the same character- 
istic polynomial. We get that 

{\-apT + pT^f = det{l-npTf. 

On taking square roots, we conclude that 

{l-apT + pT^) = detil-npT) = {l-apT)(l-apT). 

On replacing T with p~^ in this equation, we obtain the equality of the /i-factors of the 
Euler products for the Mellin transform of /(z) and of L(Xo(N),s). We have therefore 
proved the following theorem. 

Theorem 11.11 The zeta function of Xo(N) (as a curve over Q) is, up to a finite number 
of factors, the Mellin transform of f(z). 

Corollary 11.12 The strong Hasse-Weil conjecture (see below) is true for Xo(N). 
Proof. Apply Theorem 9.5. □ 
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Review of the theory of curves 

We repeat the above discussion with E replaced by a general (projective nonsingular) curve 
C. Proofs can be found (at least when the ground field is C) in Griffiths 1989. Let C be a 
complete nonsingular curve over an algebraically closed field k. Attached to C there is an 
abehan variety J , called the Jacobian variety of C such that 

J{k) = Div^(C)/{ principal divisors}. 

In the case that C is an elliptic curve, J = C, i.e., an elliptic curve is its own Jacobian. 

When A: = C it is easy to define /, at least as a complex torus. As we have already 
mentioned, the Riemann-Roch theorem shows that the holomorphic differentials Q^{C) on 
C form a vector space over k of dimension g = genus of C . 

Now assume k = €,. The map 

Hi{C,'L)^Q'^{Cy, y^ico^ I CO), 

Jy 

identifies //i(C,Z) with a lattice in ^2^(C)^ (linear dual to the vector space ^2^(C)). 
Therefore we have a g-dimensional complex torus ^2^(C)^///i(C,Z). One proves that 
there is a unique abelian variety / over C such that 7(C) = ^2^(C)^///i(C,Z). (Recall 
that not every compact complex manifold of dimension > 1 arises from an algebraic variety.) 
We next recall two very famous theorems. Fix a point P € C . 

Abel's Theorem: Let Pi , . . . , and Qi,...,Qr be elements of C(C); then 
there is a meromorphic function on C(C) with its poles at the P, and its zeros 
at the Qi if and only if, for any paths Yi from P to P, and paths y- from P to 
Qi, there exists a y in Hi(C(C),Z) such that 



/ ^2 I — I all ft). 

, Jyi . , Jy'. Jy 



Jacobi Inversion Formula: For any linear mapping /:^2^(C) C, there 
exist g points Pi , . . . , P^ in C(C) and paths y\,...,yg from P to Pi such that 
1(0)) = J2Iyi « for all (o e Q\C). 
These two statements combine to show that there is an isomorphism: 

^«,P( ^ft> j •Div*^(C)/{ principal divisors} 7(C). 

(The yi are paths from P to P, .) The construction of J is much more difficult over a general 
field k. (See my second article in: Arithmetic Geometry, eds. G. Cornell and Silverman, 
Springer, 1986.) 

The ring of correspondences ^ of C can be identified with the endomorphism ring of J, 
i.e., with the ring of regular maps a.J^J such that a(0) = 0. 
Again, there are three representations of A. 

o There is a representation p of .4 on Tgto(/) = ^2^(C)^. This is a vector space of 
dimension g over the ground field k. 

o For every I ^ char(^), there is a representation on the Tate module Pg (/) = Um Jin (k). 
This is a free -module of rank 2g. 
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o When k — C, there is a representation on Hi (C, Z). This is a free Z-module of rank 
2. 

Proposition 11.13 When k = C, 

Proof. This can be proved exactly as in the case of an elliptic curve. □ 

The rest of the results for elliptic curves extend in an obvious way to a curve C of genus 
g and its Jacobian variety J(C). 

The zeta function of Xo(N): general case 

Exactly as in the case of genus 1, the Eichler-Shimura theorem implies the following result. 

Theorem 11.14 Let f\,f2. fg be a basis for the cusp forms of degree 2 for FoiN), 
chosen to be normalized eigenforms for the Hecke operators T(p) for p prime to N . Then, 
apart from the factors corresponding to a finite number of primes, the zeta function of Xo(N) 
is equal to the product of the Mellin transforms of the fj . 

Theorem 11.15 Let f be a cusp form of weight 2, which is a normalized eigenform for 
the Hecke operators, and write f = '^a„q". Then for all primes p \ N , \ap\ < 2p^l^. 

Proof. In the course of the proof of the theorem, one finds that Up =a + a where a occurs 
in the zeta function of the reduction of Xq{N) at p. Thus this follows from the Riemann 
hypothesis. □ 

Remark 11.16 As mentioned earlier (p58), Deligne has proved the analogue of Theorem 
11.15 for all weights: let / be a cusp form of weight 2k for FoiN) and assume / is an 
eigenform for all the T(p) with p a prime not dividing A'^ and that / is "new" (see below); 
write / = YlT '^n^" withai = 1; then 

\ap\<2p^''-'^^, 

for all p not dividing A'^. The proof identifies the eigenvalues of the Hecke operator with 
sums of eigenvalues of Frobenius endomorphisms acting on the etale cohomology of a power 
of the universal elliptic curve; thus the inequality follows from the Riemann hypothesis 
for such varieties. See Deligne, Sem. Bourbaki, Fev. 1969. In fact, Deligne's paper Weil 
II simplifies the proof (for a few hints concerning this, see E. Freitag and R. Kiehl, Etale 
Cohomology and the Weil Conjecture, p278). 

The Conjecture of Taniyama and Weil 

Let E be an elliptic curve over Q. Let A'^ be its geometric conductor. It has an L-series 

oo 

L(E,s) — a„n~^ . 

n = l 

For any prime m not dividing A^, and primitive character /: (Z/mZ)^ — ^ C^, let 

oo 

A^(E,s) = N'l^ r{s) J2 cinX{n)n-'. 

n=l 
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Conjecture 11.17 (Strong Hasse-Weil conjecture) For all m prime to N, and 
all primitive Dirichlet characters x< ^x(^>^) ^ analytic continuation of C, bounded in 
vertical strips, satisfying the functional equation 

A^(E,s) = ±ig{x)xi-N)/gix))N'-'A^(E,2-s) 

where 

m 
n=l 

An eUiptic curve E over Q is said to be modular if there is a nonconstant map Xo(N)—^ 
E (defined over Q). 

Remark 11.18 Let C be a complete nonsingular curve, and fix a rational point P on C 
(assumed to exist). Then there is a canonical map (pp:C ^ J(C) sending P to 0, and the 
map is universal: for every abelian variety A and regular map (p:C ^ A sending P to 0, 
there is a unique map ifr: J{C) A such that xj/ o(pp — (p. Thus to say that E is modular 
means that there is a surjective homomorphism Jq{N) E . 

Theorem 11.19 An elliptic curve E over Q is modular if and only if it satisfies the strong 
Hasse-Weil conjecture (and in fact, there is a map Xo(N) ^ E with A'^ equal to the geometric 
conductor of E ). 

Proof. Suppose E is modular, and let co be the Neron differential on E. The pull-back of 
CO to Xo(N) can be written / (z)dz with / (z) a cusp form of weight 2 for /o(A^), and the 
Eichler-Shimura theorem shows that A{E,s) is the Melhn transform of /. (Actually, it is 
not quite this simple...) 

Conversely, suppose E satisfies the strong Hasse-Weil conjecture. Then according to 
Weil's theorem, A{E,s) is the Mellin transform of a cusp form /. The cusp form has 
rational Fourier coefficients, and the next proposition shows that there is a quotient E' of 
JoiN) whose L-series is the Mellin transform of / ; thus we have found a modular elliptic 
curve having the same zeta function as E, and a theorem of Faltings then shows that there is 
an isogeny E' ^ E. □ 

Theorem 1 1.20 (Faltings 1983) Let E and E' be elliptic curves overQ. If^(E,s) = 
^(E',s) then E is isogenous to E' . 

Proof. See his paper proving Mordell's conjecture (Invent. Math. 1983). □ 

For any M dividing N , we have a canonical map Xo{N) Xo{M), and hence a 
canonical map Jo(N) Jo(M). The intersection of the kernels of these maps is the "new" 
part of Jo(N), which is denoted /^"'^^(iV). 

Similarly, it is possible to define a subspace Sq^'^(N) of new cusp forms of weight 2. 

Proposition 11.21 There is a one-to-one correspondence between the elliptic curves E 
over Q that admit a surjective map Xo(N) E, but no surjective map Xq{M) E with 
M < N , and newforms for FoiN) that are eigenforms with rational eigenvalues. 

Proof. Given a "new" form /(z) = "^anq" as in the Proposition, we define an elliptic 
curve E equal to the intersection of the kernels of the endomorphisms T(p) — ap acting 
on J{Xo(N)). Some quotient of £ by a finite subgroup will be the modular elliptic curve 
sought. □ 
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Conjecture 11 .22 (Taniyama-Weil) Let E be an elliptic curve over Q with geometric 
conductor A'^. Then there is a nonconstant map Xq{N) —>■ E; in particular, every elliptic 
curve over Q is a modular elliptic curve. 

We have proved the following. 

Theorem 11.23 The strong Hasse-Weil conjecture for elliptic curves over Q is equivalent 
to the Taniyama-Weil conjecture. 

Conjecture 1 1.22 was suggested (a little vaguely) by Taniyama^ in 1955, and promoted 
by Shimura. Weil proved Theorem 11.23, which gave the first compelling evidence for 
the conjecture, and he added the condition that the A'^ in Xo(N) be equal to the geometric 
conductor of E, which allowed the conjecture to be tested numerically. 

Notes 

There is a vast literature on the above questions. The best introduction to it is: Swinnerton- 
Dyer, H. P. F.; Birch, B. J. Elliptic curves and modular functions. Modular functions of one 
variable, IV (Proc. Internat. Summer School, Univ. Antwerp, Antwerp, 1972), pp. 2-32. 
Lecture Notes in Math., Vol. 476, Springer, Berlin, 1975. See also: Manin, Parabolic points 
and zeta-functions of modular curves. Math. USSR 6 (1972), 19-64. 

Fermat's last theorem 

Theorem 1 1 .24 The Taniyama conjecture implies Fermat's last theorem. 

Idea: It is clear that the Taniyama conjecture restricts the number of elliptic curves 
over Q that there can be with small conductor. For example, Xo{N) has genus zero for 
A'^ = 1,2,3, ...,10, 12, 13, 16, 18,25 and so for these values, the Taniyama-Weil conjecture 
implies that there can be no elliptic curve with this conductor. (Tate showed a long time ago 
that there is no elliptic curve over Q with conductor 1, that is, with good reduction at every 
prime.) 

More precisely, one proves the following: 
Theorem 1 1.25 Let p be a prime > 2, and suppose that 

aP-bP =cP 

with a,b,c all nonzero integers and gcd(a,b,c) = 1. Then the elliptic curve 

E: = X(X-aP)(X + bP) 
is not a modular elliptic curve. 

Proof. We can assume that p > 163; moreover that 2\b and a = 3 mod 3. An easy 
calculation shows that the curve has bad reduction exactly at the primes p dividing abc, and 
at each such prime the reduced curve has a node. Thus the geometric conductor is a product 
of the primes dividing abc. 

^Taniyama was a very brilliant Japanese mathematician who was one of the main founders of the theory of 
complex multiplication of abelian varietes of dimension > 1. He killed himself in late 1958, shortly after his 
31st birthday. 
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Suppose that £" is a Weil curve. There is a weight 2 cusp form for ro(N) with integral 
^-expansion, and Ribet proves that there is a cusp form of weight 2 for /o(2) such that 
f = f modulo I. But Xo(2) has genus zero, and so there are no cusp forms of weight 2.n 

Remark 1 1.26 Ribet's proof is very intricate; it involves a delicate interplay between 
three primes l,p, and q, which is one more than most of us can keep track of (Ribet, On 
modular representations of Gal(Q/Q) arising from modular forms. Invent. Math 100 (1990), 
431^76). As far as I know, the idea of using the elliptic curve in (11.25) to attempt to 
prove Fermat's last theorem is due to G. Frey. He has published many talks about it, see 
for example, Frey, Links between solutions of A — B — C and elliptic curves, in Number 
Theory, Ulm 1987, (ed. H. Schlickewei and Wirsing), SLN 1380. 

Application to the conjecture of Birch and Swinnerton-Dyer 

Recall (Milne 2006, IV 10) that, for an elliptic curve E over Q, the conjecture of Birch and 
Swinnerton-Dyer predicts that 



where r = rank(£'(Q)), ^2 = /^^i^) \(jo\ where cv is the Neron differential on E, the product 
of the Cp is over the bad primes, TS is the Tate-Shafarevich group of E, and R{E/Q) is the 
discriminant of the height pairing. 

Now suppose £■ is a modular elliptic curve. Put the equation for E in Weierstrass minimal 
form, and let co = dx/(2y + aix + as) be the Neron differential. Assume a*co = fdz, 
for f(z) a newfonn for FoiN). Then L{E,s) is the Mellin transform of f(z). Write 
f(z) = c(q + a2q^ + . )q~^dq, where c is a positive rational number. Conjecturally c = 1, 
and so I drop it. 

Assume that ioo maps to e £. Then q is real for z on the imaginary axis between 
and ioo. Therefore j{z) and j(Nz) are real, and, as we explained (end of Section 8) this 
means that the image of the imaginary axis in Xo(N)(C) is in Xo(N)(W), i.e., the points in 
the image of the imaginary axis have real coordinates. 

The Mellin transform formula (cf. 9.2) implies that 



Intuitively at least, M is the winding number of the map from the imaginary axis from 
to /oo onto £■(!£). The image of the point in Xo(N) is known to be a point of finite 
order, and this implies that the winding number is a rational number. Thus, for a modular 
curve (suitably normalized), the conjecture of Birch and Swinnerton-Dyer can be restated as 
follows. 



limis-iyiiE^s) 



^2^pCp[TS(£/Q)]i?(£/Q) 

[£(Q)tors]2 




Define M by the equation 




Conjecture 1 1.27 (Birch and Swinnerton-Dyer) (a) The group E{Q) is infinite 
if and only if M = 0. 

(b) If M ^ 0, then M [£(Q)]2 = [ TS(£/Q)] Up Cp- 
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Remark 1 1 .28 Some remarkable results have been obtained in this context by Kolyvagin 
and others. (See: Rubin, The work of Kolyvagin on the arithmetic of elliptic curves, SLN 
1399, MR 90h: 14001), and the papers of Kolyvagin.) 

More details can be found in the article of Birch and Swinnerton-Dyer mentioned 

above. Winding numbers and the mysterious c are discussed in Mazur and Swinnerton-Dyer, 
Inventiones math., 25, 1-61, 1974. See also the article of Manin mentioned above and Milne 
2006. 



124 



II. The Algebro-Geometric Theory 



12 Complex Multiplication for Elliptic Curves Q 

The theory of complex multiplication is not only 
the most beautiful part of mathematics but also of 
the whole of science. 
D. Hilbert. 

It was known to Gauss that Q[^«] is an abelian extension of Q. Towards the end of the 
1840's Kronecker had the idea that cyclotomic fields, and their subfields, exhaust the abelian 
extensions of Q, and furthermore, that every abelian extension of a quadratic imaginary 
number field E = Q[\/—d] is contained in the extension given by adjoining to E roots of 1 
and certain special values of the modular function j . Many years later, he was to refer to 
this idea as the most cherished dream of his youth (mein liebster Jugendtraum) (Kronecker, 
Werke, V, p435).'^ 

Abelian extensions of Q 

Let Q '^^'^ = U it is a subfield of the maximal abehan extension Q^'' of Q. 

Theorem 12. 1 (Kronecker- Weber) The field Q '^^'^ = Q'''^ 
The proof has two steps. 

Elementary part. Note that there is a homomorphism 

/:Gal(Q[t„]/Q) ^ (Z/nZ)^ ai; = 

which is obviously injective. Proving that it is surjective is equivalent to proving that the 
cyclotomic polynomial 

(m.n) = \ 

is irreducible in Q[X], or that Gal(Q[^„]/Q) acts transitively on the primitive «th roots of 1. 
One way of doing this is to look modulo p, and exploit the Frobenius map (see FT 5.10). 

Application of class field theory. For every abelian extension F of Q, class field theory 
provides us with a surjective homomorphism (the Artin map) 

0:I^Gal(F/Q) 

where I is the group of ideles of Q (see CFT V, 4). When we pass to the inverse limit over 
all F's, then we obtain an exact sequence 

1 (Q^ ^ I ^ Gal(Q^VQ) 1 

where M"*" = {r e M | r > 0}, and the bar denotes the closure. 
Consider the homomorphisms 

I Gal(Q ^VQ) Gal(Q "^VQ) ^ hm(Z/mZ)^ = . 

*For a history of complex multiplication for elliptic curves, see: Schappacher, Norbert, On the history of 
Hubert's twelfth problem: a comedy of errors. Materiaux pour I'histoire des mathematiques au XX'^ siecle (Nice, 
1996), 243-273, Semin. Congr., 3, Soc. Math. France, Paris, 1998. 
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All maps are surjective. In order to show that the middle map is an isomorphism, we have to 
prove that the kernel of I ^ Z'^ is (Q'^ -1^+)"; it clearly contains (<Q^ •M+)~. 

Note that Z = ]^ , and that Z^ = ]~[ Z^ . There is therefore a canonical embedding 

and to complete the proof of the theorem, it suffices to show: 

(i) the composite Z'^ I ^ Z'^ is the identity map; 

(ii) (Q''-M+)--j(Z'') = L 

Assume these statements, and let a el. Then (ii) says that a = a ■i{z) with a e 
(Q^ ■M+)~ and z e Z^, and (i) shows that <p(a ■z) = z. Thus, if a e Ker(^), then z = 1, 
andot e (Q''-M+)-. 

The proofs of (i) and (ii) are left as an exercise (see CFT, V, 5.9). 

Alternative: Find the kernel of 4>: (A^/Q^) Gal(Q[^„]/Q), and show that every open 
subgroup of finite index contains such a subgroup. 

Alternative: For a proof using only local (i.e., not global) class field theory, see CFT, I 
4.16. 

Orders in K 

Let ^ be a quadratic imaginary number field. An order of ^ is a subring R containing Z 
and free of rank 2 over Z. Clearly every element of R is integral over Z, and so i? C Ok 
(ring of integers in K). Thus Ok is the unique maximal order. 

Proposition 12.2 Let R be an order in K. Then there is a unique integer / > such that 
i? = Z + / • Ok- Conversely, for every integer / > 0, Z + / • Ok is an order in K. 

Proof. Let {l,a} be a Z-basis for Ok, so that Oy^ = Z + Za. Then i? n Za is a subgroup 
of Za, and hence equals Za/ for some positive integer /. Now Z + / Ok C Z + Za/ <ZR. 
Conversely, if m + «a e m,n e Z, then na e R, and so « e /Z. Thus, m + na e 

1. + fa'L<Z'L + fOK- □ 

The number / is called the conductor of R. We often write for Z + / • Ok- 

Proposition 12.3 Let Rhem order in K. The following conditions on an R-submodule 
a of K are equivalent: 

(a) is a projective R-module; 

(b) R = {ae K\a-a(Za}; 

(c) a = X- Ok for some x el (this means that for all primes v of Ok, cl-Ov = x^-Ov)- 
Proof. For (b) =^ (c), see Shimura 1971, (5.4.2), p 122. □ 

Such an /?-submodule of K is called a proper ^-ideal. A proper /?-ideal of the form 
aR, a e K^, is said to be principal. If a and b are two proper /?-ideals, then 

Gi e a, bi e b| 

is again a proper /?-ideal. 

Proposition 12.4 For every order R in K, the proper R-ideals form a group with respect 
to multiplication, with R as the identity element. 

Proof. Shimura 1 97 1 , Proposition 4. 1 1 , p 1 05 . □ 
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The class group Cl(R) is defined to be the quotient of the group of proper /?-ideals by 
the subgroup of principal ideals. When R is the full ring of integers in E, then Cl(R) is the 
usual class group. 

Remark 12.5 The class number of R is 

h(R) ^h.f.{Ol: R^r' ■ n (l - (-) P-') 

piyV \PJ ) 

where h is the class number of Ok, and (— ) = 1,-1,0 according as p splits in K, stays 

prime, or ramifies. (If we write {± 1} for the Galois group of K over Q, then p (^) is 
the reciprocity map.) See Shimura 1971, Exercise 4.12. 



Elliptic curves over C 

For every lattice A in C, the Weierstrass p and p' functions realize C/yl as an elliptic curve 
Ei^A), and every elliptic curve over C arises in this way. If A and A are two lattices, and a 
is an element of C such that aA C A', then [z] [az] is a homomorphism E{^A) E{^A), 
and every homomorphism is of this form; thus 

^om{E{A),E{A)) = {a € C I c yl'}. 

In particular, £'(/!) ^ E{A) if and only if A — aA for some a eC^. 

These statements reduce much of the theory of elliptic curves over C to linear algebra. 
For example, End(£') is either Z or an order i? in a quadratic imaginary field K. Consider 
E = E{Ay, if End(£) ^ Z, then there is an a e C, a ^ Z, such that aA c A, and 

End(E) = {aeC\aAcA}, 

which is an order in Q[a] having yl as a proper ideal. 

When End(£') = i? 7^ Z, we say E has complex multiplication by R. 

Write E = E(A), so that E(C) = C/A. Clearly £„(C), the set of points of order 
dividing n on E, is equal to n~^A/A, and so it is a free Z/«Z-module of rank 2. The 

def 

inverse limit, TgE = lim Egm = \iml~"'A/A = A<SiZi, and so V^E = A<SiQi- 
Algebraicity of j 

When R is an order in a quadratic imaginary field ^ C C, we write Ell(^) for the set of 
isomorphism classes of elliptic curves over C with complex multiplication by R. 

def 

Proposition 12.6 For each proper R-ideal a, E(a) = C/a is an elliptic curve with com- 
plex multiplication by R, and the map a C/a induces a bijection 

C\iR)^El\{R). 
Proof. If is a proper -ideal, then 

End(£'(a)) = {a e C | ao C a} (see above) 
= {a e A" I aa C a} (easy) 
= R (definition of proper -ideal). 

Since E(a ■ a) ^ E(b) we get a well-defined map C\(R) E\\(R). Similar arguments show 
that it is bijective. □ 
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Corollary 12.7 Up to isomorphism, there are only Gnitely many elliptic curves over C 
with complex multiplication by R; in fact there are exactly h{R). 

With an elliptic curve E over C, we can associate its 7 -invariant j{E) e C, and E ^ E' 
if and only if j{E) = j'(E'). For an automorphism a of C, we define oE to be the curve 
obtained by applying o to the coefficients of the equation defining E. Clearly j(oE) = 
oj{E). 

Theorem 12.8 If E has complex multiplication then j(E) is algebraic. 

Proof. Let z e C. If z is algebraic (meaning algebraic over Q), then z has only finitely 
many conjugates, i.e., as a ranges over the automorphisms of C, az ranges over a finite 
set. The converse of this is also true: if z is transcendental, then oz takes on uncountably 
many different values (if z' is any other transcendental number, there is an isomorphism 

Q[z] Q[z'] which can be extended to an automorphism of C). 

Now consider j(E). As a ranges over C, aE ranges over finitely many isomorphism 
classes, and so (yj(E) ranges over a finite set. This shows that j{E) is algebraic. □ 

Corollary 12.9 Let / be the (usual) modular function for r(\), and let z e H be such 
that Q[z] is a quadratic imaginary number Held. Then j(z) is algebraic. 

Proof. The function j is defined so that j(z) — j{E{A)), where A = Z + Zz. Suppose 
Q[z] is a quadratic imaginary number field. Then 

{aeC\ a(Z + Zz) c Z + Zz} 

is an order R in Q[z], and E{A) has complex multiplication by R, from which the statement 
follows. □ 

The integrality of j 

Let E be an elliptic curve over a field k and let /? be an order in a quadratic imaginary 
niraiber field K. When we are given an isomorphism i :R ^ End(£'), we say that E has 
complex multiplication by R (defined over k). Then R and Z^ act on T^E, and therefore 

def 

RiS'z'^l acts on T^E; moreover, K<SiqQi acts on ViE = TiE These actions commute 
with the actions of Gal(^^V k) on the modules. 

Let a be an endomorphism of an elliptic curve E over a field k. Define, 

Tr(Q;) = 1 + deg(o!) - deg(l - a) e Z, 

and define the characteristic polynomial of a to be 

fa(X) = X^-Tr(a)X + degia) e Z[X]. 

Proposition 12.10 (a) The endomorphism faia) of E is zero. 

(b) For all I ^ char(k), fa(X) is the characteristic polynomial of a acting onV^E. 

Proof. Part (b) is proved in Silverman 1986, 2.3, pl34. Part (a) follows from (b), the 
Cayley-Hamilton theorem, and the fact that the End(£') acts faithfully on V^E (Silverman 
1986, 7.4, p92). □ 



C OROLL ARY 12.11 If E has complex multipUcation byRcK, then Vi E is a free KiS>Qi- 
module. 
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Proof. When the ground field k = C, this is obvious because ViE = A <S)zQi, and 
A(SizQe = ®i Q) <^Q'Qe = K <^q Q^. When K (8)q is a field, it is again obvious 
(every module over a field is free). Otherwise KiSiqQi = Ky® where v and w are the 
primes of K lying over p, and we have to see that ViE is isomorphic to the K (8> -module 
Ky © Kyj (rather than for example). But for a € ^, a ^ Q, the proposition shows 
that characteristic polynomial of a acting on V^E is the minimum polynomial of a over K, 
and this implies what we want. □ 

Remark 12.12 In fact T^E is a free R Z^-module (see J-P. Serre and J. Tate, Good 
reduction of abelian varieties, Ann. of Math. 88, 1968, pp 492-517, p502). 

Proposition 12.13 The action ofGalik'^/ k) on Vi E factors through Ki^Qi, i.e., there 
is a homomorphism pf.Gal(k^/ k) ^ (K(^ Qe)^ such that 

Pi(a)-x = ax, alia eGal{k^'/k), xeViA. 

Proof. The action of Ga\.(k^/ k) on VgE commutes with the action R (because we are 
assuming that the action of R is defined over k). Therefore the image ofGal(A:^/A:) Ues m 
EndA-®Q^ {VlE), which equals K Q^, because V^E is free K Q^-module of rank 1. □ 

In particular, we see that the image of pi is abelian, and so the action of Ga^^^V ^) 
factors through Gal(^*/ k) — all the ^"'-torsion points of E are rational over fc* for all m. 

As Gal(/c^'/ k) is compact, Im(pi) C O^, where Oi is the ring of integers in ^0qQ£ (O^ 
is either a complete discrete valuation ring or the product of two such rings). 

Theorem 12.14 Let E be an elliptic curve over a n umber field k having complex multi- 
plication by R overk. Then E has potential good reduction at every prime vofk (i.e., E 
acquires good reduction after a finite extension of k ). 

Proof. Let € be a prime number not divisible by v. According to Silverman 1986, Vn.7.3, 
pi 86, we have to show that the action of the inertia group ly at v onT^A factors through a 
finite quotient. But we know that it factors through the inertia subgroup Jy of Gal(/:*/ k), 
and class field theory tells us that there is a surjective map 

where Oy is the ring of integers in kv Thus we obtain a homomorphism 

O^^Jy^ C AutiTiE), 

where Oi is the ring of integers in ^ . I claim that every homomorphism Oy ^ 
automatically factors through a finite quotient. In fact algebraic number theory shows that 
has a subgroup of finite index which is a pro-/?-group, where p is the prime lying 
under v. Similary, has a subgroup of finite index V which is a pro-£-group. Any map 
from a pro-/) group to a pro-l!-group is zero, and so Ker(?7^ O^) = Ker(J7^ /V), 
which shows that the homomorphism is zero on a subgroup of finite index ofU^. □ 

Corollary 12.15 If E is an elliptic curve over a number field k with complex multipli- 
cation, then j(E) e Ok- 

Proof. An elementary argument shows that, if E has good reduction at v, then j(E) e Oy 
(cf. Silverman 1986, Vn.5.5, pl81). □ 
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Corollary 12.16 Let j be the (usual) modular function for r(l), and letz eW be such 
that Q[z] is a quadratic imaginary number field. Then j{z) is an algebraic integer. 

Remark 12.17 There are analytic proofs of the integrality of j(E), but they are less 
illuminating. 

Statement of the main theorem (first form) 

Let A' be a quadratic imaginary number field, with ring of integers Ok, and let E[1(Ok) be 
the set of isomorphism classes of elliptic curves over C with complex multiplication by Ok- 
For any fractional Oi^: -ideal A in K, we write j(A) for j(C/A). (Thus if A = Za»i + Z(W2 
where z = ct)i/cL>2 lies in the upper half plane, then j(A) = j{z), where j{z) is the standard 
function occurring in the theory of elliptic modular functions.) 

Theorem 12.18 (a) For any elliptic curve E overC with complex multiplication by 

Ok, K[j{E)] is the Hilbert class field K^""^ ofK. 

(b) The group GaliK^'^yK) permutes the set{ j(E) \ E e EI\{Ok)} transitively 

(c) For each prime ideal pofK, Frob{p) (j (A)) = j{A-p~^). 

The proof will occupy the next few subsections. 
The theory of o-isogenies 

Let R be an order in K, and let a be a proper ideal in R. For an elliptic curve E over a field 
k with complex multiplication by /?, we define 

Ker(o) = P) Ker(a: E ^ E). 
aeo 

Note that if a = (ai , . . . , fl„), then Ker(a) = n Ker(a, : E E). Let A be a proper 7?-ideal, 
and consider the elliptic curve E{A) over C. Then • a~Ms also a proper ideal. 

Lemma 12.19 There is a canonical map E(A) E(A-a~^) with kernel Ker(o). 
Proof. Since Ac A- o~\ we can take the map tohe z + A\-^ z + A- a~^. □ 

Proposition 12.20 Let E be an elliptic curve over k with complex multiplication by 
R, and let a be a proper ideal in R. Assume k has characteristic zero. Then there is an 
elliptic curve a ■ E and a homomorphism map (pa'.E^a-E whose kernel is Ker(a). The 
pair (a- E, (pa) has the following universal property: for any homomorphism (p:E ^ E' with 
Ker(^) D Ker(o), there is a unique homomorphism xjf.a- E ^ E' such that xj/ oip^ — cp. 

Proof. When A: = C, we write E = E{A) and take a- E = E(A • a~^). If A: is a field of 
characteristic zero, we define a-E = E(A • a~^) (see Silverman 1986, 4.12, 4.13.2, p78).n 

We want to extend the definition of o- £■ to the case where k need not have characteristic 
zero. For this, we define o- to be the image of the map 

X (flix, ...,a„x): E ^ E", o = (ai, . .. ,a„), 

and (pa to be this map. We call the isogeny cpa'.E a- E (or any isogeny that differs from it 

def 

by an isomorphism) an a-isogeny. The degree of an a-isogeny is N(a) = (Ok '• a)- 

We obtain an action, (o, E) i-^ a- E,of Cl(R) on Ell;;. (R), the set of isomorphism classes 
of elUptic curves over k having complex multipUcation by R. 
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Proposition 12.21 The action of Cl(R) on Ellk{R) makes Ell;t(^) into a principal ho- 
mogeneous space for Cl(R), i.e., for any xq e Ell(R), the map o a • xq: C1(/?) E[l(R) 
is a bijection. 

Proof. When k = C, this is a restatement of an earlier result (before we implicitly took 
xo to be the isomorphism of class of C/R, and considered the map Cl(i?) Ell(i?), 
^ a"^ ■ Xo). We omit the proof of the general case, although this is a key point. □ 

Reduction of elliptic curves 

Let £■ be an elliptic curve over a number field k with good reduction at a prime vofk. For 
simplicity, assume that v does not divide 2 or 3. Then E has an equation 

Y^Z = X^ + aXZ^ + bZ^ 

with coefficients in Oy whose discriminant A is not divisible by pi,. 

Reduction of the tangent space 

Recall that for a curve C defined by an equation F(X, Y) — 0, the tangent space at (a,b) on 
the curve is defined by the equation: 



dF 
dx 



(a,b) 



(Y-b) = 0. 

(.a,b) 



For example, for 

Z = X^ + aXZ^ + bZ^ 
we find that the tangent space to E at (0, 0) is given by the equation 

Z = 0. 

Now take a Weierstrass minimal equation for E over Ov — we can think of the equation as 
defining a curve £ over Ov, and use the same procedure to define the tangent space Tgto(£^) 
at on £ — ^it is an Oy-module. 

Proposition 12.22 The tangent space Tgto(£) atOto£ is a free Oy-module of rank one 
such that 

Tgto(f ) 00, Kv = Tgto{E/Kv), Tgto(f ) k(v) = Tgto(^(i;)) 

where k(v) = Oy/pv ^nd E(v) is the reduced curve. 

Proof. Obvious. □ 

Thus we can identify Tgto(<?) (in a natural way) with a submodule of Tgto(£'), and 
Tgto(£'(i;)) = TgtQ(£^)/m„ ■TgtQ(£"), where is the maximal ideal of 
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Reduction of endomorphisms 

Let a: E ^ E' he a homomorphism of elliptic curves over k, and assume that both E and E' 
have good reduction at a prime vofk. Then a defines a homomorphism a(v): E{v) ^ E' (v) 
of the reduced curves. Moreover, a acts as expected on the tangent spaces and the points of 
finite order. In more detail: 

(a) the map TgtQ(Q;):Tgto(£') TgtQ(£'') maps Tgto(<?) into Tgl^iS'), and induces the 
map TgtQ(a(i;)) on the quotient modules; 

(b) recall (Silverman 1986) that for i ^ char(/<:(i>)) the reduction map defines an isomor- 
phism TiiE) — > Ti(Eo); there is a commutative diagram: 

TiE TtE' 

nE{v) nE'iv). 

It follows from (b) and Proposition 12.10 that a and ao have the same characteristic 
polynomial (hence the same degree). 

Also, we shall need to know that the reduction of an a-isogeny is an o-isogeny (this is 
almost obvious from the definition of an o-isogeny). 

Finally, consider an a-isogeny (p:E^E';it gives rise to a homomorphism 

Tgto(£)^Tgto(£') 

whose kernel is P| Tgto {a), a running through the elements of a (this again is almost obvious 
from the definition of a-isogeny). 

The Frobenius map 

Let E be an elliptic curve over the finite field /c D F^. If £" is defined by 

= X'^ +aX + b, 

then write E^'^^ for the elliptic curve 

72 ^ x'^ + a" X + b'^ . 

Then the Frobenius map Frobg is defined to be 

{x,y)^(x'^,y'^y.E^E^'i\ 

Proposition 12.23 The Frobenius map Frobj, is a purely inseparable isogeny of degree 
p; if (p:E E' is a second purely inseparable isogeny of degree p, then there is an 
isomorphism a: E^^^ E' such that a oFrob^ = cp. 

Proof. This is similar to Silverman 1986, 2.11, p30. We have (Frobp)* (k(E'^P^)) = 
k{E)P , which the unique subfield of k{E) such that k{E) D k{E)P is a purely inseparable 
extension of degree p. □ 

Remark 12.24 There is the following criterion: A homomorphism a : £ E' is separable 
if and only the map it defines on the tangent spaces TgtQ(£') Tgto(£'0 is an isomorphism. 
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Proof of the main theorem 

The group G = Gal(Q^'/ K) acts on EII(i?), and commutes with the action of Cl(i?). Fix an 
xo e E\\(R), and for a eG, define (p(a) e Cl(R) by: 

axo = xo-(p(a). 

One checks directly that (p(a) is independent of the choice of xo, and that f is a. homomor- 
phism. Let L be a finite extension of K such that 

(a) (p factors through GaI(L/Q); 

(b) there is an elliptic curve E defined over L with j -invariant 7(0), some proper ^-ideal 
0. 

Lemma 12.25 There is a set S of prime ideals of K of density one excluding those that 
ramify in L, such that 

q>{<p^) = Cl(p) 

for allp e S ; here (pp e Gal(L / K) is a Frobenius element. 
Proof. Let p be a prime ideal of K such that 

(i) p is unramified in L; 

(ii) E has good reduction at some prime ideal lying over p; 

(iii) p has degree 1, i.e., N(p) = p, a. prime number. 

The set of such p has density one (conditions (i) and (ii) exclude only finitely many 
primes, and it is a standard result (CFT, VI 3.2) that the primes satisfying (iii) have density 
one). 

To prove the equation, we have to show that 

(PpiE) Kip-E. 

We can verify this after reducing mod 

We have a p-isogeny E ^ p - E. When we reduce modulo p, this remains a p-isogeny. 
It is of degree N(p) = p, and by looking at the tangent space, one sees that it is purely 
inseparable. Now (Pp(E) reduces to E^p\ and we can apply Proposition 12.23 to see that 
E^P^ is isomorphic p- E. □ 

We now prove the theorem. Since the Frobenius elements Frobp generate Gai(L/ K), 
we see that cp is surjective; whence (a) of the theorem. Part (b) is just what we proved. 

The main theorem for orders 

(Outline) Let Rf he an order in K. Just as for the maximal order Ok, the ideal class group 
Cl(R) can be identified with a quotient of the idele class group of K, and so class field theory 
shows that there is an abelian extension K f of K such that the Artin reciprocity map defines 
an isomorphism 

(l):Cl(Rf) ^GaliK f/K). 

Of course, when / = I, K f is the Hilbert class field. The field K f is called the ring class 
field. Note that in general €1(7? f) is much bigger than CI(Ok)- 
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The same argument as before shows that if has complex multiplication by , then 
K[j(E f)] is the ring class field for K. Kronecker predicted (I think)^ that K^^ should equal 

def 

K* =Q '^y'^ • K', where K' = [JK(j(E /)) (union over positive integers). Note that 



and so K* is obtained from K by adjoining the special values j(z) of j and the special 
values e2^''"/" of . 

Theorem 12.26 The Galois group Gal(K^'^/K*) is a product of groups of order 2. 
Proof. Examine the kernel of the map Ik Gal(K*/K). □ 

Points of order m 

(Outline) We strengthen the main theorem to take account of the points of finite order. Fix 
an m, and let E be an elliptic curve over C with complex multiplication by Ok- For any 
a e Aut(C) fixing K, there is an isogeny a:E aE, which we may suppose to be of degree 
prime to m. Then a maps E^ into aEm, and we can choose a so that a(x) = ax mod m) 
for all X e TfE(= YlT^E). We know that a will be an o-isogeny for some o, and under our 
assumptions a is relatively prime to m. 

Write Id(m) for the set of ideals in K relatively prime to m, and Cl(77i) for the corre- 
sponding ideal class group. The above construction gives a homomorphism 



Let Km be the abelian extension of K (given by class field theory) with Galois group Cl(m). 

Theorem 12.27 The homomorphism factors through Ga.\{Km/ K), and is the reciprocal 
of the isomorphism given by the Artin reciprocity map. 

Proof. For m = I, this is the original form of the main theorem. A similar argument works 




(union over r e K, r e H), 



Aut(C/^) ^Cl(m). 



in the more general case. 



□ 



Adelic version of the main theorem 



Omitted. 



^Actually, it is not too clear exactly what Kronecker predicted — see the articles of Schappacher. 
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